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(54) METHOD AND DEVICE FOR MULTI-ANGLE CONNECTING AND ENCODING BIT STREAM 



(57) A method and device for interleaving bit stream 
by which multimedia data including digital picture data, 
audio data, and sub-video signals are seamless-repro- 
duced by smoothly switching videos and voices to each 
other without disturbing videos, allowing noise to be 
contained in voices, and discontinuing voices at angle 
switching sections during multi-angle reproduction from 
an optical disk on which multi-media data are recorded. 
A multi-angle system stream is constituted of a plurality 
of system streams composed of picture data and audio 

Fig.2 



data created at different viewing points. In the multi- 
angle system stream from which a system stream corre- 
sponding to an angle can be reproduced at every prede- 
termined unit by freely switching the system stream to 
another during reproduction, the display time of picture 
data contained in the system stream corresponding to a 
angle and the display time of audio data are made equal 
to each other for every angle at every prescribed unit at 
which the angle can be switched. 



100 



Scenario St£ 



3JPJ- 



!3P_C 



Sub picture 



-j 400 



Video 
stream 



is 



M 



1S» 1»0 



CL 
LU 



Primed by Xerox (UK) Business Services 
2.16.3/3.4 



EP 0 847 200 A1 



Description 

Technical Field 

5 The present invention relates to a method and apparatus for interleaving a bitstream for use in an authoring system 

for variously processing a data bitstream comprising the video data, audio data, and sub-picture data constituting each 
of plural program titles containing related video data, audio data, and sub-picture data content to generate a bitstream 
from which a new title containing the content desired by the user can be reproduced, and efficiently recording and 
reproducing said generated bitstream using a particular recording medium. 

10 

Background Art 

Authoring systems used to produce program titles comprising related video data, audio data, and sub-picture data 
by digitally processing, for example, multimedia data comprising video, audio, and sub-picture data recorded to laser 
is disk or video CD formats are currently available. 

Systems using Video-CDs in particular are able to record video data to a CD format disk, which was originally 
designed with an approximately 600 MB recording capacity for storing digital audio data only, by using such high effi- 
ciency video compression techniques as MPEG. As a result of the increased effective recording capacity achieved 
using data compression techniques, karaoke titles and other conventional laser disk applications are gradually being 
20 transferred to the video CD format. 

Users today expect both sophisticated title content and high reproduction quality. To meet these expectations, each 
title must be composed from bitstreams with an increasingly deep hierarchical structure. The data size of multimedia 
titles written with bitstreams having such deep hierarchical structures, however, is ten or more times greater than the 
data size of less complex titles. The need to edit small image (title) details also makes it necessary to process and con- 
25 trol the bitstream using low order hierarchical data units. 

It is therefore necessary to develop and prove a bitstream structure and an advanced digital processing method 
including both recording and reproduction capabilities whereby a large volume, multiple level hierarchical digital bit- 
stream can be efficiently controlled at each level of the hierarchy. Also needed are an apparatus for executing this digital 
processing method, and a recording media to which the bitstream digitally processed by said apparatus can be effi- 
30 ciently recorded for storage and from which said recorded information can be quickly reproduced. 

Means of increasing the storage capacity of conventional optical disks have been widely researched to address the 
recording medium aspect of this problem. One way to increase the storage capacity of the optical disk is to reduce the 
spot diameter D of the optical (laser) beam. If the wavelength of the laser beam is I and the aperture of the objective 
lens is NA, then the spot diameter D is proportional to I/NA, and the storage capacity can be efficiently improved by 
35 decreasing I and increasing NA. 

As described, for example, in United States Patent 5,235,581 , however, coma caused by a relative tilt between the 
disk surface and the optical axis of the laser beam (hereafter "tilt") increases when a large aperture (high NA) lens is 
used. To prevent tilt-induced coma, the transparent substrate must be made very thin. The problem is that the mechan- 
ical strength of the disk is low when the transparent substrate is very thin. 
40 MPEG1, the conventional method of recording and reproducing video, audio, and graphic signal data, has also 
been replaced by the more robust MPEG2 method, which can transfer large data volumes at a higher rate. It should be 
noted that the compression method and data format of the MPEG2 standard differ somewhat from those of MPEG1 . 
The specific content of and differences between MPEG1 and MPEG2 are described in detail in the ISO-1 1 172 and ISO- 
13818 MPEG standards, and further description thereof is omitted below. 
45 Note, however, that while the structure of the encoded video stream is defined in the MPEG2 specification, the hier- 
archical structure of the system stream and the method of processing lower hierarchical levels are not defined. 

As described above, it is therefore not possible in a conventional authoring system to process a large data stream 
containing sufficient information to satisfy many different user requirements. Moreover, even if such a processing 
method were available, the processed data recorded thereto cannot be repeatedly used to reduce data redundancy 
so because there is no large capacity recording medium currently available that can efficiently record and reproduce high 
volume bitstreams such as described above. 

More specifically, particular significant hardware and software requirements must be satisfied in order to process a 
bitstream using a data unit smaller than the title. These specific hardware requirements include significantly increasing 
the storage capacity of the recording medium and increasing the speed of digital processing; software requirements 
55 include inventing an advanced digital processing method including a sophisticated data structure. 

Therefore, the object of the present invention is to provide an effective authoring system for controlling a multimedia 
data bitstream with advanced hardware and software requirements using a data unit smaller than the title to better - 
address advanced user requirements. 
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scene data common to plural titles and the <*Z£*™*££ ^eW Sected and reproduced is desirable, 
taining plural scenes unique to particular re P r ^ on ^ S a ^.„ %ZS**» period are arranged on the same 
However, when plura. scenes unique to a re P[<^^^^ Setore unavoidabty inserted between 

s^rro^^ 

see" data is that reproduction is scene data. i.e.. scene data showing 

A further problem can be expected when the J****** *™ " broadcast, this multi-angle scene data 
substantially the same subject from ^S^£ «merSTn different locations. 7*e probf em is thus 

may be obtained by «^ ™S«£ ieTes during data presentation, data from different angles ,s con- 

that when the user selects one of these plural angie^ scenes _ « * expected. 

nected where the angle changes, and a natural. ^^^^^^^ natural, seamless reproduction 
Therefore, the object of the present irwention .s *^ a method for generating a sys- 

without scene data intermitting can be ^^^^^^^Tspp^os for recording and reproduc- 
tem stream having said data structure; a recording «g*^J££^ r Srded and from which said system 

===5SESS5sr 5£r 

29. ^i lheTnt'e contents of which are expressly incorporated by reference here,n. 
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Thepresent invention - been developed w«h a view ~ 
and has for its essentia, object to ^^^^^S^SSSS wSh for the presentation of a bitstream 

in order to achieve the aforementioned ob ective ^^^mwmprising M or ^ data units contiguous 
that is reproduced by selecting two or more date unrts ^£^£2^ »• selected data units in a particular 
on the same time-base is characterized by 9^^^^ *J date unit so that it is possible to sequentially 
sequence on the same time^ase based on ^^°^^^,a 8e irrt ermittence. and is characterized by 
access all data units and present only the selected date bitstream by ranging the shortest-read- 

dividing said data units into shortest-read-time date ^ a j£^ 

time date unrts in a particular sequence on the sa^e time ^ b * s ^ o un rts and present only the shortest- 



read-time 
read-time data unit being the same. 
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Brief Description of Drawings 

2. 4 1 . sidTvtew ol an ophea, disk stonng the muM media brt stream o, F,g. 1 , 
F» 5 s ar, enlarged view ehowteg a portion eontined b, a oirde ol Fig. 4. 
%l is an Sed view showing a portion confined b, ■* »*0lF,g.5. 
Fto7isasidevie«sh(^r«avariaionoftFiec 1 *cald.akolF.94 

Fig! 8 is e aide »iew showing another ^^J^^^^Zl^^,^^^^^. 

F 4»rewS^ 

5. ,', k a dwm view schema** showing one examp4e o. a heck pat pahern termed on me optica, disK o. 

5 « is a dfcgona, view schema one erWe o. a «ft pam pahern termed on me optica, d* o, 

5. ?4 * a P-an view showing anome- e*»*,e o, heck p«h termed on ere reoprding sodace o„he cphca, disk 0, 
Z % ,s a graph showing me aodio waves resutong horn me eodio daU recorded te Ihe m*an„,e date witin 
multi-angle scene period. 
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Fig. 16 is a graph schematically showing the structure of multimedia bit stream for use in Digital Video Disk system 
according to the present invention, 

Fig. 17 is a graph schematically showing the encoded video stream according to the present invention, 
Fig. 18 is a graph schematically showing an internal structure of a video zone of Fig. 16. 
5 Fig. 19 is a graph schematically showing the stream management information according to the present invention, 

Fig. 20 is a graph schematically showing the structure the navigation pack NV of Fig. 17, 

Fig. 21 is a graph is assistance of explaining a concept of parental lock playback control according to the present 
invention, 

Fig. 22 is a graph schematically showing the data structure used in a digital video disk system according to the 
10 present invention, 

Fig. 23 is a graph in assistance of explaining a concept of Mult-angle scene control according to the present inven- 
tion, 

Fig. 24 is a graph in assistance of explaining a concept of multi scene data connection, 
Fig. 25 is a block diagram showing a DVD encoder according to the present invention, 
15 Fig. 26 is a block diagram showing a DVD decoder according to the present invention, 

Fig. 27 is a graph schematically showing an encoding information table generated by the encoding system control- 
ler of Fig. 25, 

Fig. 28 is a graph schematically showing an encoding information tables, 

Fig. 29 is a graph schematically showing an encoding parameters used by the video encoder of Fig. 25, 
20 Fig. 30 is a graph schematically showing an example of the contents of the program chain information according to 
the present invention, 

Fig. 31 is a graph schematically showing another example of the contents of the program chain information accord- 
ing to the present invention, 

Fig. 32 is a graph schematically showing a relationship between the accumulated data quantity in the stream buffer 
25 and the operating mode of the reading head of Fig. 26, 

Fig. 33 is a graph in assistance of explaining a concept of sharing data between plural titles, 

Fig. 34 is a graph in assistance of explaining a concept of contiguous reproduction of non-contiguously arranged 

data block, 

Fig. 35 is a graph in assistance of explaining a concept of interleaving process enabling seamless reproduction of 
30 scenes, 

Fig. 36 is a graph also in assistance of explaining a concept of interleaving process enabling seamless reproduction 
of scenes, 

Fig. 37 is a graph schematically showing an actual arrangement of data blocks recorded to a data recording track 
on a recording medium according to the present invention, 
35 Fig. 38 is a graph schematically showing contiguous block regions and interleaved block regions array, 

Fig. 39 is a graph schematically showing a content of a VTS title VOBS according to the present invention, 

Fig. 40 is a graph schematically showing an internal data structure of the interleaved block regions according to the 

present invention, 

Fig. 41 is a graph in assistance of explaining a concept of defining the minimum number of divisions in VOB for 
40 interleaving, 

Fig. 42 is a graph schematically showing the data structure of the interleaved unit, 

Fig. 43 is a graph in assistance of explaining a concept of parental control according to the present invention, 
Fig. 44 is a graph in assistance of explaining a concept of multi-angle scene control according to the present in 
invention, 

45 Fig. 45 is a graph in assistance of explaining the multi-angle scene control when two mufti-angle data having differ- 
ent audio data, 

Fig. 46 is a graph schematically showing plural angle data written to multi-angle scene period according to the 
present invention, 

Fig. 47 is a graph showing the stream containing a multi-angle scene period and the basics of the on-disk layout 
so according to the present invention, 

Fig. 48 is a graph showing an example in which the last pack address in the current A-ILVU and the address of the 
next A-ILVU are recorded for plural angles according to the present invention, 

Figs. 49 and 50 are graphs in assistance of explaining the method of managing addresses on the switching unit 
level between multiple angle data in multi-angle scene periods according to the present invention, 
55 Fig. 51 is a flow chart, formed by Figs. 51 A and 51 B, showing an operation of the DVD encoder of Fig. 25. 
Fig. 52 is a flow chart showing detailed of the encode parameter production sub-routine of Fig. 51, 
Fig. 53 is a flow chart showing the detailed of the VOB data setting routine of Fig. 52, 
Fig. 54 is a flow chart showing the encode parameters generating operation for a seamless switching, 



4 



EP 0 847 200 A1 

Fig 55 is a flow chart showing the encode parameters generating operation for a system stream. 

Fig % is a flow chart showing details of the multi-angle seamless ^swrtch-ng control rout,ne of Rg. 56. 
Fig'. 59 is a flow chart showing details of the parental lock sub-routne _c Rg. 56. 

S"« and 63 are graphs showing decoding infection table produced by the decoding system com^^ 

W F?a 64 is a flow chart showing details of reproduction extracted PGC routing of Ra 69 

S Is a Z chart showing details of non-seamless * rOCeSS ° f 7 °' 

15 is a flow chart showing details of the decoder synch ™^P"~»£ 6? ' 

scene period, ^ . hrtH frtr orhievina the data structure shown in Fig. 46, 

data is written to each different angle according to the present invents. 
Best Mode for Carrying Out the Invention 

The prevent invention is detailedly described with reference to the accompanying drawings. 
p nt o ctriirtiim nf the aythnrinQ system 

ionl to the content of an entire movie, orto iuet one scene «m» a tmee. More 

video title sets. More specifically, each vdeo zone ^ '.near sequence v 0 ^ set ^ w> is used 

#K where K is an integer va.ue of zero or ^ at ^^ video title set. 

system of the present invention, and comprises plural video zones VZ. 
A.^hnring *nr.oder EC 

A peeked en^imen, ot the .noting encode, EC ««»*, W 



20 



25 



30 



35 



40 



45 



50 



55 



5 



10 



15 



20 



25 



30 



EP 0 847 200 A1 

, rom a t«eo t^ or ome, recording ■~*^*" ~£ these three source signals, a* is to conven- 

While the multimedia source .Mm ■ stomm Ft»2as oompns ^g ^ (ypes c| 

rrm£L*^^^ 

T^^XTjTJZr^^^ -0. reefer ,*». and reoo,om 9 median, 
M T*. ufdeo zona «. cc^es — c*C ,»B, .000. -ormasar 1 ,00. and ubfcmre and fee 

""STSSS: InSded 0, ». authoring encode, E C of fhe presen, ambodimm. is receded b, way of exauple 

onlvto an optical disk. ,, 1It _ lrt _ scenario data. i.e.. the user-defined editing 

The scenario editor 100 of the authoring encoder EC [^^J^ZSLi a bitstream MBS according to 
instructions. The scenario data controis editing ^^^^^eTgina. multimedia title. This scenario 
he user's manipulation of the video. «f^*SJS T£SSm5?1S source'stream buffer. The scenario «frtor 

f 00 Ts 1 ~n«an°= 
^uferisthusaUe*^^^^^^ 

and speaker to confirm the content of the JJ^SS^SiS npS deuces while confirming the content of the «.e 
desired scenario using the keyboard. T^f •^^^^ulation is the scenario data St7. 
on the display and speakers. The result of this ^Sna what source data is selected from all or a subset 

The scenario data St7 is basically a ^'^252^5^. selected source data is reassembled 
of the source data containing plural trt.es ^"^JJ^CSi the instructions received through the keyboard 

and SB a Known Urns Td and man «^ >*~™ » i^ncocu'ng proo^. Mora spacif ,0.11* whan data ancodmg 
This daisy is required fo, synchron,za»on wW me "JStSSu* la., whan aneoding inanediataly follows ad»,ng. 
and use, generation scenario data St7 a,. ^^T^^^ editing pwess based or. the seenano data 
M Td is raquired to determine the conttnl oMhe f^^^tce dMa ^ * dslayed by tuna Td to synchrc- 

""SSUt- M' — — *» drasooroaa.a.mbut.armay.har^ebadatarrninsdaoo.rdmg^^dalay 

tims Td raquired and the allowaMe m^ a *™ j^, „ the scenario editor 100 and receives the scenario data SO 
The encorf,ng systee, oontroWr 200 « a mnaaad segment contains) in the scenanodat. 

therefrom. Besed on the ttmubas. I^°" '™ L'^f. en^pammete, signals St9. Stff . and Stf3 lor encodmg 

STS; S^^"TS» * SETS- buna,, a. are synched fo 
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encoding parameter signals St9, St11, and St13. 

More specifically, encoding parameter signal St9 is the video encoding signal specifying the encoding timing of 
video stream St1 to extract the encoding segment from the video stream St1 and generate the video encoding unit. 
Encoding parameter signal St11 is likewise the sub-picture stream encoding signal used to generate the sub-picture 

5 encoding unit by specifying the encoding timing for sub-picture stream St3. Encoding parameter signal St1 3 is the audio 
encoding signal used to generate the audio encoding unit by specifying the encoding timing for audio stream St5. 

Based on the time-base relationship between the encoding segments of streams Stl . St3, and St5 in the multime- 
dia source data contained in scenario data St7, the encoding system controller 200 generates the timing signals St21 , 
St23, and St25 arranging the encoded multimedia-encoded stream in the specified time-base relationship. 

w The encoding system controller 200 also generates the reproduction time information IT defining the reproduction 
time of the title editing unit (video object, VOB), and the stream encoding data St33 defining the system encode param- 
eters for multiplexing the encoded multimedia stream containing video, audio, and sub-picture data. Note that the repro- 
duction time information IT and stream encoding data St33 are generated for the video object VOB of each title in one 
video zone VZ. 

15 The encoding system controller 200 also generates the title sequence control signal St39, which declares the for- 
matting parameters for formatting the title editing units VOB of each of the streams in a particular time-base relationship 
as a multimedia bitstream. More specifically, the title sequence control signal St39 is used to control the connections 
between the title editing units (VOB) of each title in the multimedia bitstream MBS, or to control the sequence of the 
interleaved title editing unit (VOBs) interleaving the title editing units VOB of plural reproduction paths. 

20 The video encoder 300 is connected to the source stream buffer of the scenario editor 1 00 and to the encoding sys- 
tem controller 200, and receives therefrom the video stream St1 and video encoding parameter signal St9, respectively. 
Encoding parameters supplied by the video encoding signal St9 include the encoding start and end timing, bit rate, the 
encoding conditions for the encoding start and end, and the material type. Possible material types include NTSC or PAL 
video signal, and telecine converted material. Based on the video encoding parameter signal St9, the video encoder 

25 300 encodes a specific part of the video stream St1 to generate the encoded video stream St1 5. 

The sub-picture encoder 500 is similarly connected to the source stream buffer of the scenario editor 100 and to 
the encoding system controller 200, and receives therefrom the sub-picture stream St3 and sub-picture encoding 
parameter signal St1 1 , respectively. Based on the sub-picture encoding parameter signal St1 1 , the sub-picture encoder 
500 encodes a specific part of the sub-picture stream St3 to generate the encoded sub-picture stream St 17. 

30 The audio encoder 700 is also connected to the source stream buffer of the scenario editor 1 00 and to the encoding 
system controller 200, and receives therefrom the audio stream St5 and audio encoding parameter signal St13, which 
supplies the encoding start and end timing. Based on the audio encoding parameter signal St13, the audio encoder 700 
encodes a specific part of the audio stream St5 to generate the encoded audio stream St19. 

The video stream buffer 400 is connected to the video encoder 300 and to the encoding system controller 200. The 

35 video stream buffer 400 stores the encoded video stream St 1 5 input from the video encoder 300, and outputs the stored 
encoded video stream St15 as the time-delayed encoded video stream St27 based on the timing signal St21 supplied 
from the encoding system controller 200. 

The sub-picture stream buffer 600 is similarly connected to the sub-picture encoder 500 and to the encoding sys- 
tem controller 200. The sub-picture stream buffer 600 stores the encoded sub-picture stream St1 7 output from the sub- 

40 picture encoder 500. and then outputs the stored encoded sub-picture stream St17 as time-delayed encoded sub-pic- 
ture stream St29 based on the timing signal St23 supplied from the encoding system controller 200. 

The audio stream buffer 800 is similarly connected to the audio encoder 700 and to the encoding system controller 
200. The audio stream buffer 800 stores the encoded audio stream St19 input from the audio encoder 700, and then 
outputs the encoded audio stream St19 as the time<lelayed encoded audio stream St31 based on the timing signal 

45 St25 supplied from the encoding system controller 200. 

The system encoder 900 is connected to the video stream buffer 400, sub-picture stream buffer 600. audio stream 
buffer 800, and the encoding system controller 200. and is respectively supplied thereby with the time-delayed encoded 
video stream St27, time-delayed encoded sub-picture stream St29, time-delayed encoded audio stream St31, and the 
stream encoding data St33. Note that the system encoder 900 is a multiplexer that multiplexes the time-delayed 

so streams St27. St29. and St31 based on the stream encoding data St33 (timing signal) to generate title editing unit 
(VOB) St35. The stream encoding data St33 contains the system encoding parameters, including the encoding start 
and end timing. 

The video zone formatter 1300 is connected to the system encoder 900 and the encoding system controller 200 
from which the title editing unit (VOB) St35 and title sequence control signal St39 (timing signal) are respectively sup- 
55 plied. The title sequence control signal St39 contains the formatting start and end timing, and the formatting parameters 
used to generate (format) a multimedia bitstream MBS. The video zone formatter 1300 rearranges the title editing units 
(VOB) St35 in one video zone VZ in the scenario sequence defined by the user based on the title sequence control sig- 
nal St39 to generate the edited multimedia stream data St43. 



7 



EP 0 847 200 A1 



The multimedia bitstream MBS St43 edited according to the user-defined scenario is then sent to the recorder 
1200. The recorder 1200 processes the edited multimedia stream data St43 to the data stream St45 format of the 
recording medium M, and thus records the formatted data stream St45 to the recording medium M. Note that the mul- 
timedia bitstream MBS recorded to the recording medium M contains the volume file structure VFS, which includes the 
physical address of the data on the recording medium generated by the video zone formatter 1300. 

Note that the encoded multimedia bitstream MBS St35 may be output directly to the decoder to immediately repro- 
duce the edited title content. It will be obvious that the output multimedia bitstream MBS will not in this case contain the 
volume file structure VFS. 

Authoring de coder DC 

A preferred embodiment of the authoring decoder DC used to decode the multimedia bitstream MBS edited by the 
authoring encoder EC of the present invention, and thereby reproduce the content of each title unit according to the 
user-defined scenario, is described next below with reference to Fig. 3. Note that in the preferred embodiment 
described below the multimedia bitstream St45 encoded by the authoring encoder EC is recorded to the recording 
medium M. 

As shown in Fig. 3, the authoring decoder DC comprises a multimedia bitstream producer 2000. scenario selector 
21 00, decoding system controller 2300, stream buffer 2400, system decoder 2500, video buffer 2600, sub-picture buffer 
2700, audio buffer 2800, synchronizer 2900. video decoder 3800, sub-picture decoder 3100, audio decoder 3200, syn- 
thesizer 3500, video data output terminal 3600, and audio data output terminal 3700. 

The bitstream producer 2000 comprises a recording media drive unit 2004 for driving the recording medium M; a 
reading head 2006 for reading the information recorded to the recording medium M and producing the binary read sig- 
nal St57; a signal processor 2008 for variously processing the read signal St57 to generate the reproduced bitstream 
St61 ; and a reproduction controller 2002. 

The reproduction controller 2002 is connected to the decoding system controller 2300 from which the multimedia 
bitstream reproduction control signal St53 is supplied, and in turn generates the reproduction control signals St55 and 
St59 respectively controlling the recording media drive unit (motor) 2004 and signal processor 2008. 

So that the user-defined video, sub-picture, and audio portions of the multimedia title edited by the authoring 
encoder EC are reproduced, the authoring decoder DC comprises a scenario selector 2100 for selecting and reproduc- 
ing the corresponding scenes (titles). The scenario selector 2100 then outputs the selected titles as scenario data to 
the authoring decoder DC. 

The scenario selector 21 00 preferably comprises a keyboard, CPU, and monitor. Using the keyboard, the user then 
inputs the desired scenario based on the content of the scenario input by the authoring encoder EC. Based on the key- 
board input, the CPU generates the scenario selection data St51 specifying the selected scenario. The scenario selec- 
tor 2100 is connected by an infrared communications device, for example, to the decoding system controller 2300, to 
which it inputs the scenario selection data St51 . 

Based on the scenario selection data St51 , the decoding system controller 2300 then generates the bitstream 
reproduction control signal St53 controlling the operation of the bitstream producer 2000. 

The stream buffer 2400 has a specific buffer capacity used to temporarily store the reproduced bitstream St61 input 
from the bitstream producer 2000, extract the address information and initial synchronization data SCR (system clock 
reference) for each stream, and generate bitstream control data St63. The stream buffer 2400 is also connected to the 
decoding system controller 2300, to which it supplies the generated bitstream control data St63. 

The synchronizer 2900 is connected to the decoding system controller 2300 from which it receives the system clock 
reference SCR contained in the synchronization control data St81 to set the internal system clock STC and supply the 
reset system clock St79 to the decoding system controller 2300. 

Based on this system clock St79, the decoding system controller 2300 also generates the stream read signal St65 
at a specific interval and outputs the read signal St65 to the stream buffer 2400. 

Based on the supplied read signal St65. the stream buffer 2400 outputs the reproduced bitstream St61 at a specific 
interval to the system decoder 2500 as bitstream St67. 

Based on the scenario selection data St51. the decoding system controller 2300 generates the decoding signal 
St69 defining the stream Ids for the video, sub-picture, and audio bitstreams corresponding to the selected scenario, 
and outputs to the system decoder 2500. 

Based on the instructions contained in the decoding signal St69, the system decoder 2500 respectively outputs the 
video, sub-picture, and audio bitstreams input from the stream buffer 2400 to the video buffer 2600, sub-picture buffer 
2700, and audio buffer 2800 as the encoded video stream St71 , encoded sub-picture stream St73. and encoded audio 
stream St75. 

The system decoder 2500 detects the presentation time stamp PTS and decoding time stamp DTS of the smallest 
control unit in each bitstream St67 to generate the time information signal St77. This time information signal St77 is sup- 



8 



EP0 847 200 A1 



plied to the synchronizer 2900 through the decoding system controller 2300 as the synchronization control data St81 . 

Based on this synchronization control data St81, the synchronizer 2900 determines the decoding start timing 
whereby each of the bitstreams will be arranged in the correct sequence after decoding, and then generates and inputs 
the video stream decoding start signal St89 to the video decoder 3800 based on this decoding timing. The synchronizer 
2900 also generates and supplies the sub-picture decoding start signal St91 and audio stream decoding start signal 
St93 to the sub-picture decoder 3100 and audio decoder 3200, respectively. 

The video decoder 3800 generates the video output request signal St84 based on the video stream decoding start 
signal St89. and outputs to the video buffer 2600. In response to the video output request signal St84, the video buffer 
2600 outputs the video stream St83 to the video decoder 3800. The video decoder 3800 thus detects the presentation 
time information contained in the video stream St83, and disables the video output request signal St84 when the length 
of the received video stream St83 is equivalent to the specified presentation time. A video stream equal in length to the 
specified presentation time is thus decoded by the video decoder 3800, which outputs the reproduced video signal 
St104 to the synthesizer 3500. 

The sub-picture decoder 3100 similarly generates the sub-picture output request signal St86 based on the sub-pic- 
ture decoding start signal St91, and outputs to the sub-picture buffer 2700. In response to the sub-picture output 
request signal St86, the sub-picture buffer 2700 outputs the sub-picture stream St85 to the sub-picture decoder 3100. 
Based on the presentation time information contained in the sub-picture stream St85 f the sub-picture decoder 3100 
decodes a length of the sub-picture stream St85 corresponding to the specified presentation time to reproduce and 
supply to the synthesizer 3500 the sub-picture signal St99. 

The synthesizer 3500 superimposes the video signal St104 and sub-picture signal St99 to generate and output the 
multi-picture video signal St105 to the video data output terminal 3600. 

The audio decoder 3200 generates and supplies to the audio buffer 2800 the audio output request signal St88 
based on the audio stream decoding start signal St93. The audio buffer 2800 thus outputs the audio stream St87 to the 
audio decoder 3200. The audio decoder 3200 decodes a length of the audio stream St87 corresponding to the specified 
presentation time based on the presentation time information contained in the audio stream St87, and outputs the 
decoded audio stream St101 to the audio data output terminal 3700. 

It is thus possible to reproduce a user-defined multimedia bitstream MBS in real-time according to a user-defined 
scenario. More specifically, each time the user selects a different scenario, the authoring decoder DC is able to repro- 
duce the title content desired by the user in the desired sequence by reproducing the multimedia bitstream MBS corre- 
sponding to the selected scenario. 

It is therefore possible by means of the authoring system of the present invention to generate a multimedia bit- 
stream according to plural user-defined scenarios by real-time or batch encoding multimedia source data in a manner 
whereby the substreams of the smallest editing units (scenes), which can be divided into plural substreams, expressing 
the basic title content are arranged in a specific time-base relationship. 

The multimedia bitstream thus encoded can then be reproduced according to the one scenario selected from 
among plural possible scenarios. It is also possible to change scenarios while playback is in progress, i.e., to select a 
different scenario and dynamically generate a new multimedia bitstream according to the most recently selected sce- 
nario. It is also possible to dynamically select and reproduce any of plural scenes while reproducing the title content 
according to a desired scenario. 

It is therefore possible by means of the authoring system of the present invention to encode and not only reproduce 
but to repeatedly reproduce a multimedia bitstream MBS in real-time. 

A detail of the authoring system is disclosed Japanese Patent Application filed September 27, 1996, and entitled 
and assigned to the same assignee as the present application. 

DVD 

An example of a digital video disk (DVD) with only one recording surface (a single-sided DVD) is shown in Fig. 4. 

The DVD recording medium RC1 in the preferred embodiment of the invention comprises a data recording surface 
RS1 to and from which data is written and read by emitting laser beam LS. and a protective layer PL1 covering the data 
recording surface RS1 . A backing layer BL1 is also provided on the back of data recording surface RS1 The side of the 
disk on which protective layer PL1 is provided is therefore referred to below as side SA (commonly "side A"), and the 
opposite side (on which the backing layer BL1 is provided) is referred to as side SB ("side B"). Note that digital video 
disk recording media having a single data recording surface RS1 on only one side such as this DVD recording medium 
RC1 is commonly called a single-sided single layer disk. 

A detailed illustration of area C1 in Fig. 4 is shown in Fig. 5. Note that the data recording surface RS1 is formed by 
applying a metallic thin film or other reflective coating as a data layer 4109 on a first transparent layer 4108 having a 
particular thickness T1 . This first transparent layer 41 08 also functions as the protective layer PL1 . A second transpar- 
ent substrate 4111 of a thickness T2 functions as the backing layer BL1, and is bonded to the first transparent layer 
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4108 by means of an adhesive layer 41 10 disposed therebetween. 

A printing layer 4112 for printing a disk label may also be disposed on the second transparent substrate 41 1 1 as 
necessary. The printing layer 41 12 does not usually cover the entire surface area of the second transparent substrate 
41 1 1 (backing layer BL1 ), but only the area needed to print the text and graphics of the disk label. The area of second 

5 transparent substrate 41 1 1 to which the printing layer 41 12 is not formed may be left exposed. Light reflected from the 
data layer 4109 (metallic thin film) forming the data recording surface RS1 can therefore be directly observed where the 
label is not printed when the digital video disk is viewed from side SB. As a result, the background looks like a silver- 
white over which the printed text and graphics float when the metallic thin film is an aluminum thin film, for example. 
Note that it is only necessary to provide the printing layer 41 12 where needed for printing, and it is not necessary 

io to provide the printing layer 4112 over the entire surface of the backing layer BL1 . 

A detailed illustration of area C2 in Fig. 5 is shown in Fig. 6. Pits and lands are molded to the common contact sur- 
face between the first transparent layer 4108 and the data layer 4109 on side SA from which data is read by emitting a 
laser beam LS, and data is recorded by varying the lengths of the pits and lands (i.e., the length of the intervals between 
the pits). More specifically, the pit and land configuration formed on the first transparent layer 4108 is transferred to the 

is data layer 4109. The lengths of the pits and lands are shorter, and the pitch of the data tracks formed by the pit 
sequences is narrower, than with a conventional Compact Disc (CD). The surface recording density is therefore greatly 
improved. 

Side SA of the first transparent layer 4108 on which data pits are not formed is a flat surface. The second transpar- 
ent substrate 41 1 1 is for reinforcement, and is a transparent panel made from the same material as the first transparent 
20 layer 41 08 with both sides flat. Thickness T1 and T2 are preferably equal and commonly approximately 0.6 mm, but the 
invention shall not be so limited. 

AS with a CD, information is read by irradiating the surface with a laser beam LS and detecting the change in the 
reflectivity of the light spot. Because the objective lens aperture N A can be large and the wavelength I of the light beam 
small in a digital video disk system, the diameter of the light spot Ls used can be reduced to approximately 1/1 .6 the 
25 light spot needed to read a CD. Note that this means the resolution of the laser beam LS in the DVD system is approx- 
imately 1 .6 times the resolution of a conventional CD system. 

The optical system used to read data from the digital video disk uses a short 650 nm wavelength red semiconductor 
laser and an objective lens with a 0.6 mm aperture NA. By thus also reducing the thickness T of the transparent panels 
to 0.6 mm, more than 5 GB of data can be stored to one side of a 120 mm diameter optical disk. 
30 It is therefore possible to store motion picture (video) images having an extremely large per unit data size to a digital 
video disk system disk without losing image quality because the storage capacity of a single-sided, single-layer record- 
ing medium RC1 with one data recording surface RS1 as thus described is nearly ten times the storage capacity of a 
conventional CD. As a result, while the video presentation time of a conventional CD system is approximately 74 min- 
utes if image quality is sacrificed, high quality video images with a video presentation time exceeding two hours can be 
35 recorded to a DVD. 

The digital video disk is therefore well-suited as a recording medium for video images. 

A digital video disk recording medium with plural recording surfaces RS as described above is shown in Figs. 7 and 
8. The DVD recording medium RC2 shown in Fig. 7 comprises two recording surfaces, i.e.. first recording surface RS1 
and semi-transparent second recording surface RS2, on the same side. i.e. side SA, of the disk. Data can be simulta- 

40 neously recorded or reproduced from these two recording surfaces by using different laser beams LS1 and LS2 for the 
first recording surface RS1 and the second recording surface RS2. It is also possible to read/write both recording sur- 
faces RS1 and RS2 using only one of the laser beams LS1 or LS2. Note that recording media thus comprised are called 
"single-side, dual-layer disks." 

It should also be noted that while two recording surfaces RS1 and RS2 are provided in this example, it is also pos- 

45 sible to produce digital video disk recording media having more than two recording surfaces RS. Disks thus comprised 
are known as "single-sided, multi-layer disks." 

Though comprising two recording surfaces similarly to the recording media shown in Fig. 7, the DVD recording 
medium RC3 shown in Fig. 8 has the recording surfaces on opposite sides of the disk, i. e.. has the first data recording 
surface RSI on side SA and the second data recording surface RS2 on side SB. It will also be obvious that while only 

so two recording surfaces are shown on one digital video disk in this example, more than two recording surfaces may also 
be formed on a double-sided digital video disk. As with the recording medium shown in Fig. 7, it is also possible to pro- 
vide two separate laser beams LS1 and LS2 for recording surfaces RS1 and RS2, or to read/write both recording sur- 
faces RS1 and RS2 using a single laser beam. Note that this type of digital video disk is called a "double-sided, dual- 
layer disk." It will also be obvious that a double-sided digital video disk can be comprised with two or more recording 

55 surfaces per side. This type of disk is called a "double-sided, multi-layer disk." 

A plan view from the laser beam LS irradiation side of the recording surface RS of the DVD recording medium RC 
is shown in Fig. 9 and Fig. 10. Note that a continuous spiral data recording track TR is provided from the inside circum- 
ference to the outside circumference of the DVD. The data recording track TR is divided into plural sectors each having 
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the same known storage capacity. Note that for simplicity only the data recording track TR is shown in Fig. 9 with more 
than three sectors per revolution. 

As shown in Fig. 9, the data recording track TR is normally formed clockwise inside to outside (see arrow DrA) from 
the inside end point IA at the inside circumference of disk RCA to the outside end point OA at the outside circumference 

5 of the disk with the disk RCA rotating counterclockwise RdA. This type of disk RCA is called a clockwise disk, and the 
recording track formed thereon is called a clockwise track TRA. 

Depending upon the application, the recording track TRB may be formed clockwise from outside to inside circum- 
ference (see arrow DrB in Fig. 10) from the outside end point OB at the outside circumference of disk RCB to the inside 
end point IB at the inside circumference of the disk with the disk RCB rotating clockwise RdB. Because the recording 

w track appears to wind counterclockwise when viewed from the inside circumference to the outside circumference on 
disks with the recording track formed in the direction of arrow DrB, these disks are referred to as counterclockwise disk 
RCB with counterclockwise track TRB to distinguish them from disk RCA in Fig. 9. Note that track directions DrA and 
DrB are the track paths along which the laser beam travels when scanning the tracks for recording and playback. Direc- 
tion of disk rotation RdA in which disk RCA turns is thus opposite the direction of track path DrA, and direction of disk 

75 rotation RdB in which disk RCB turns is thus opposite the direction of track path DrB. 

An exploded view of the single-sided, dual-layer disk RC2 shown in Fig. 7 is shown as disk RC2o in Fig. 11. Note 
that the recording tracks formed on the two recording surfaces run in opposite directions. Specifically, a clockwise 
recording track TRA as shown in Fig. 9 is formed in clockwise direction DrA on the (lower) first data recording surface 
RS1 , and a counterclockwise recording track TRB formed in counterclockwise direction DrB as shown in Fig. 10 is pro- 

20 vided on the (upper) second data recording surface RS2. As a result, the outside end points OA and OB of the first and 
second (top and bottom) tracks are at the same radial position relative to the center axis of the disk RC2o. Note that 
track paths DrA and DrB of tracks TR are also the data read/write directions to disk RC. The first and second (top and 
bottom) recording tracks thus wind opposite each other with this disk RC, i.e., the track paths DrA and DrB of the top 
and bottom recording layers are opposite track paths. 

25 Opposite track path type, single-sided, dual-layer disks RC2o rotate in direction RdA corresponding to the first 
recording surface RS1 with the laser beam LS traveling along track path DrA to trace the recording track on the first 
recording surface RS1. When the laser beam LS reaches the outside end point OA, the laser beam LS can be refo- 
cused to end point OB on the second recording surface RS2 to continue tracing the recording track from the first to the 
second recording surface uninterrupted. The physical distance between the recording tracks TRA and TRB on the first 

30 and second recording surfaces RS1 and RS2 can thus be instantaneously eliminated by simply adjusting the focus of 
the laser beam LS. 

It is therefore possible with an opposite track path type, single-sided, dual-layer disk RC2o to easily process the 
recording tracks disposed to physically discrete top and bottom recording surfaces as a single continuous recording 
track. It is therefore also possible in an authoring system as described above with reference to Fig. 1 to continuously 

35 record the multimedia bitstream MBS that is the largest multimedia data management unit to two discrete recording sur- 
faces RS1 and RS2 on a single recording medium RC2o. 

It should be noted that the tracks on recording surfaces RS1 and RS2 can be wound in the directions opposite 
those described above, i.e., the counterclockwise track TRB may be provided on the first recording surface RS1 and the 
clockwise track TRA on the second recording surface RS2. In this case the direction of disk rotation is also changed to 

40 a clockwise rotation RdB, thereby enabling the two recording surfaces to be used as comprising a single continuous 
recording track as described above. For simplification, a further example of this type of disk is therefore neither shown 
nor described below. 

It is therefore possible by thus constructing the digital video disk to record the multimedia bitstream MBS for a fea- 
ture-length title to a single opposite track path type, single-sided, dual-layer disk RC2o. Note that this type of digital 

45 video disk medium is called a single-sided dual-layer disk with opposite track paths. 

Another example of the single-sided, dual-layer DVD recording medium RC2 shown in Fig. 7 is shown as disk RC2p 
in Fig. 12. The recording tracks formed on both first and second recording surfaces RS1 and RS2 are clockwise tracks 
TRA as shown in Fig. 9. In this case, the single-sided, dual-layer disk RC2p rotates counterclockwise in the direction of 
arrow RdA, and the direction of laser beam LS travel is the same as the direction of the track spiral, i.e., the track paths 

so of the top and bottom recording surfaces are mutually parallel (parallel track paths). The outside end points OA of both 
top and bottom tracks are again preferably positioned at the same radial position relative to the center axis of the disk 
RC2p as described above. As also described above with disk RC2o shown in Fig. 1 1 , the access point can be instan- 
taneously shifted from outside end point OA of track TRA on the first recording surface RS1 to the outside end point OA 
of track TRA on the second recording surface RS2 by appropriately adjusting the focus of the laser beam LS at outside 

55 end point OA. 

However, for the laser beam LS to continuously access the clockwise recording track TRA on the second recording 
surface RS2, the recording medium RC2p must be driven in the opposite direction (clockwise, opposite direction RdA). 
Depending on the radial position of the laser beam LS. however, it is inefficient to change the rotational direction of the 
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recording medium. As shown by the diagonal arrow in Fig. 12. the laser beam LS is therefore moved from the oute.de 
end S oTof the track on the first receding surface RSI to the inside end point .A of the track on the second record- 
ing surface RS2 to use these physically discrete recording tracks as one logically contmuous recording track. 

Rather than using the recording tracks on top and bottom recording surfaces as one continuous recording track, it 
is al^sslbTe to use the recording tracks to record the multimedia bitstreams MBS for different ft.es. This type of d.g- 
ital video disk recording medium is called a "single-sided, dual-layer disk with parallel track paths. 

Note mat direction of the tracks formed on the recording surfaces RSI and RS2 is oppose fttt described 
abovVi e counterclockwise recording tracks TRB are formed, disk operation remains the same as that described 
ahove' except for the direction of disk rotation, which is clockwise as shown by arrow RdB. 

Xh2 us ng clockwise or counterclockwise recording tracks, the single-sided, dual-layer d.sk £C2p 
trackTaVri thus described is well-suited to storing on a single disk encyclopedia and s,m.lar multimedia brtstreams 
comorisina multiple titles that are frequently and randomly accessed. 

T Tested view of the dual-sided single-layer DVD recording medium RC3 comprising one record.ng surface 
.ayer^sTaS RS^Tn each side as shown ?n Fig. 8 is shown as DVD recording medium RC3s in Fig. 
reird^ng track TR A is provided on the one recording surface RS1 . and a counterc.ockw.se recording track : TRB m pro- 
X on the other recording surface RS2. As in the preceding recording media, the outs.de end points OA and OB of 
^eSrSngTacks on each recording surface are preferably positioned at the same rad.a. position relative to the 

roni&r axis of the DVD recording medium RC3s. . L . ^ Al 

Note *\a while the recording tracks on these recording surfaces RS1 and RS2 rotate .n opposrte d-rect.ons.ttie 
track oatt,s are symmetrical. This type of recording medium is therefore known as a doub.e-s.ded dual layer disk wrth 
^SSSSSSSi This double-sided dual .ayer disk with symmetrica, track paths RC3s rotates in director, RdA 
SreaSgSSg the first recording surface RS1. As a result the track path on the second recording: surface RS2 
on the o^srtTsWe is opposite the direction DrB in which the track winds, i.e.. direction DrA. Access.ng both recording 
surfaced *ar^ msVSnJa single laser beam LS is therefore not realistic irrespective of whether access is contm- 
uous o^ addition, a muttimedia brtstream MBS is separate* recorded to the recording surfaces on 

th6 T^reT^^Tf^Ls^ sing.e .ayer disk RC3 shown in Rg. 8 is shown in Fig. 14 as disk RC3a 
Note thauhrdrsk comprises clockwise recording tracks TRA as shown in Fig. 9 on both recordingsurfaces RSI land 
SS2 A^vSh the PrecSng recording media, the outside end points OA and OA of the recording tracks on each record- 
rnfsurSc^are pre eraWy positioned at the same radial position relative to the center axis of the DVD recording medium 
RC3a*nHte^S ^SJXd dua. layer disk with symmetrical track paths RC3s described above the tracks .on these 
retina surfaces RS1 and RS2 are asymmetrical. This type of disk is therefore known as a double-sided dual layer 
Z^£Z£ £* paths. This doub.e-sided dua. .ayer disk with asymmetrica. track paths RC3a rotates in 
2 rectTon K^en Sading/wrrting the first recording surface RSI . As a result, the track path , onthe second record.ng 
suSe RS2 on ttte opposite is opposite the direction DrA in which the track winds, i.e.. direction DrB. 

Thi ^^LnsThatTa faser beam Sis driven continuously from the inside circumference to the outs.de arcumfer- 
ence^n the^irst re^rding surface RS1, and then from the outside circumference to the .ns.de c^cunjerenco on tt,e 
seSnd ^co-Sg Sace RS2. both sides of the recording medium RC3a can be reaoVwntten without turning the disk 
over and without providing different laser beams for the two sides. ^ ^ ^ 

trac^ oatos tor retorting surface RSI ahd RS2 ar. also toe same wnh toisdooble-rMded dual layard.sk ««. 

! , n ™»» ^C3a As a result it is also possible to read/write both sides of the disk without provri.ng sap- 
SSSiSSffSlSS. SZZSSZ SETro. is toroed over behveeb sidas. ahd toe readme appa- 

— ."ST ZSSZSgSS. aihs Iderrtca I ^ » — *• 

track TRB is provided in place of clockwise recording track TRA on both record.ng surfaces RS1 and RSJL 

As describes aSve the true value of a DVD system whereby the storage capacity of the recording medium can be 

m^SSm^SSi* ™"«P ,e recordin 9 surtace iS reali26d in mU,timedia ^™ ^SlTl^ 
Z JSHSSL audio data units and plural graphics data units recorded to a single d.sk are reproduced through ,nter - 

aCBV .t SS^JSSS> achieve one .ong-standing desire of software (programming) providers, m^^o- 
vide P^Sng^ntent such as a commercia. movie on a sing.e recording medium in p.ura. versions for different lan- 
guage and demographic groups while retaining the image quality of the original. 

Parental control 

Content providers of movie and video titles have conventionally had to produce, supply z^™™^™*!* 
of inSS mZ in muttiple .anguages. typically the .anguage of each distribution market ^^^E^JT 
ages conforming to the parental control (censorship) regulations of individual countries in Europe and North America. 



12 



10 



15 



20 



EP 0 847 200 A1 

The time and resources required for this are significant. While high image quality is obviously important, the program- 
ming content must also be consistently reproducible. 

The digital video disk recording medium is close to solving these problems. 

5 Multiple angles 

One interactive operation widely sought in multimedia applications today is for the user to be able to change the 
position from which a scene is viewed during reproduction of that scene. This capability is achieved by means of the 

multiple angle function. 

This multiple angle function makes possible applications whereby, for example, a user can watch a baseball game 
from different angles (or virtual positions in the stadium), and can freely switch between the views while viewing is in 
progress In this example of a baseball game, the available angles may include a position behind the backstop centered 
on the catcher batter, and pitcher; one from behind the backstop centered on a fielder, the pitcher, and the catcher; and 
one from center field showing the view to the pitcher and catcher. To meet these requirements, the digital video disk 
system uses MPEG, the same basic standard format used with Video-CDs to record the video, audio, graphics, and 
other signal data Because of the differences in storage capacity, transfer rates, and signal processing performance 
within the reproduction apparatus, DVD uses MPEG2, the compression method and data format of which differ slightly 
from the MPEG1 format used with Video-CDs. 

It should be noted that the content of and differences between the MPEG1 and MPEG2 standards have no direct 
relationship to the intent of the present invention, and further description is therefore omitted below (tor more informa- 
tion see MPEG specifications ISO-1 1 172 and ISO-13818). 

The data structure of the DVD system according to the present invention is described in detail below with reference 
to Figs. 16. 17. 18, 19, 20, and 21. 

25 Multi-scene control 

A fully functional and practical parental lock playback function and multi-angle scene playback function must enable 
the user to modify the system output in minor, subtle ways while still presenting substantially the same video and audio 
oulput If these functions are achieved by preparing and recording separate titles satisfying each of the many possible 
parental lock and multi-angle scene playback requests, titles that are substantially identical and differ in only minor ways 
must be recorded to the recording medium. This results in identical data being repeatedly recorded to the larger part of 
the recording medium, and significantly reduces the utilization efficiency of the available storage capacity. More partic- 
ularly it is virtually impossible to record discrete titles satisfying every possible request even using the massive capacity 
of the digital video disk medium. While it may be concluded that this problem can be easily solved by increasing the 
capacity of the recording medium, this is an obviously undesirable solution when the effective use of available system 
resources is considered. 

Using multi-scene control, the concept of which is described in another section below, in a DVD system, it is possi- 
ble to dynamically construct titles for numerous variations of the same basic content using the smallest possible amount 
of data and thereby effectively utilize the available system resources (recording medium). More specifically, titles that 
can be played back with numerous variations are constructed from basic (common) scene periods containing data com- 
mon to each title and multi-scene periods comprising groups of different scenes corresponding to the various requests. 
During reproduction, the user is able to freely and at any time select particular scenes from the mult-scene periods to 
dynamically construct a title conforming to the desired content, e g., a title omitting certain scenes using the parental 

lock control function. . , . . . 

Note that multi-scene control enabling a parental lock playback control function and multi-angle scene playback is 
described in another section below with reference to Fig. 21 . 

Data structure of the DVD system 

The data structure used in the authoring system of a digital video disk system according to the present invention is 
shown in Fig 22 To record a multimedia bitstream MBS. this digital video disk system divides the recording medium 
into three major recording areas, the lead-in area LI. the volume space VS. and the lead-out area LO. 

The lead-in area LI is provided at the inside circumference area of the optical disk. In the disks descnbed with ref- 
erence to Figs. 9 and 10. the lead-in area LI is positioned at the inside end points IA and IB of each track. Data for sta- 
bilizing the operation of the reproducing apparatus when reading starts is written to the lead-in area LI. 

The lead-out area LO is correspondingly located at the outside circumference of the optical disk. i.e. . at outside end 
points OA and OB of each track in the disks described with reference to Figs. 9 and 10. Data identifying the end of the 
volume space VS is recorded in this lead-out area LO. 
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The volume space VS is located between the lead-in area LI and lead-out area LO, and is recorded as a one- 
dimensional array of n+1 (where n is an integer greater than or equal to zero) 2048-byte logic sectors LS. The logic sec- 
tors LS are sequentially number #0, #1 , #2, ... #n. The volume space VS is also divided into a volume and file structure 
management area VFS and a file data structure area FDS. 
5 The volume and file structure management area VFS comprises m+1 logic sectors LS#0 to LS#m (where m is an 

integer greater than or equal to zero and less than n. The file data structure FDS comprises n-m logic sectors LS #m+1 
to LS #n. 

Note that this file data structure area FDS corresponds to the multimedia bitstream MBS shown in Fig. 1 and 
described above. 

io The volume file structure VFS is the file system for managing the data stored to the volume space VS as files, and 
is divided into logic sectors LS#0 - LS#m where m is the number of sectors required to store all data needed to manage 
the entire disk, and is a natural number less than n. Information for the files stored to the file data structure area FDS is 
written to the volume file structure VFS according to a known specification such as ISO-9660 or ISO-13346. 

The file data structure area FDS comprises n-m logic sectors LS#m - LS#n, each comprising a video manager 
is VMG sized to an integer multiple of the logic sector (2048 x I, where I is a known integer), and k video title sets VTS #1 
- VTS#k (where k is a natural number less than 100). 

The video manager VMG stores the title management information for the entire disk, and information for building a 
volume menu used to set and change reproduction control of the entire volume. 

Any video title set VTS #k is also called a "video file" representing a title comprising video, audio, and/or still image 
20 data. 

The internal structure of each video title set VTS shown in Fig. 22 is shown in Fig. 16. Each video title set VTS com- 
prises VTS information VTSI describing the management information for the entire disk, and the VTS title video objects 
VOB (VTSTT_VOBS), i.e., the system stream of the multimedia bitstream. The VTS information VTSI is described first 
below, followed by the VTS title VOBS. 
25 The VTS information primarily includes the VTSI management table VTSLMAT and VTSPGC information table 
VTS_PGCIT. 

The VTSI management table VTSLMAT stores such information as the internal structure of the video title set VTS. 
the number of selectable audio streams contained in the video title set VTS, the number of sub-pictures, and the video 
title set VTS location (storage address). 

30 The VTSPGC information table VTS_PGCIT records i (where i is a natural number) program chain (PGC) data 
blocks VTS_PGCI #1 - VTS_PGCI #i for controlling the playback sequence. Each of the table entries VTS_PGC I #i is 
a data entry expressing the program chain, and comprises j (where j is a natural number) cell playback information 
blocks C_PBI #1 - C_PBI Each cell playback information block C_PBI #j contains the playback sequence of the cell 
and playback control information. 

35 The program chain PGC is a conceptual structure describing the story of the title content, and therefore defines the 
structure of each title by describing the cell playback sequence. Note that these cells are described in detail below. 

If, for example, the video title set information relates to the menus, the video title set information VTSI is stored to 
a buffer in the playback device when playback starts. If the user then presses a MENU button on a remote control 
device, for example, during playback, the playback device references the buffer to fetch the menu information and dis- 

40 play the top menu #1 . If the menus are hierarchical, the main menu stored as program chain information VTS_PGCI #1 
may be displayed, for example, by pressing the MENU button, VTS_PGCI #2 - #9 may correspond to submenus 
accessed using the numeric keypad on the remote control, and VTS_PGCI #10 and higher may correspond to addi- 
tional submenus further down the hierarchy. Alternatively, VTS_PGCI #1 may be the top menu displayed by pressing 
the MENU button, while VTS_PGCI #2 and higher may be voice guidance reproduced by pressing the corresponding 

is numeric key. 

The menus themselves are expressed by the plural program chains defined in this table. As a result, the menus 
may be freely constructed in various ways, and shall not be limited to hierarchical or non-hierarchical menus or menus 
containing voice guidance. 

In the case of a movie, for example, the video title set information VTSI is stored to a buffer in the playback device 
so when playback starts, the playback device references the cell playback sequence described by the program chain PGC, 
and reproduces the system stream. 

The "cells" referenced here may be all or part of the system stream, and are used as access points during playback. 
Cells can therefore be used, for example, as the "chapters" into which a title may be divided. 

Note that each of the PGC information entries C_PBI #j contain both cell playback processing information and a 
>5 cell information table. The cell playback processing information comprises the processing information needed to repro- 
duce the cell, such as the presentation time and number of repetitions. More specifically, this information includes the 
cell block mode CBM. cell block type CBT. seamless playback flag SPF. interleaved allocation flag IAF, STC resetting 
flag STCDF, cell presentation time C_PBTM, seamless angle change flag SACF, first cell VOBU start address 
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C_FVOBlLSA, and the last cell VOBU start address C_LVOBU_SA. 

Note that seamless playback refers to the reproduction in a digital video disk system of multimedia data including 
video, audio, and sub-picture data without intermittent breaks in the data or information. Seamless playback is 
described in detail in another section below with reference to Fig. 23 and Fig. 24. 

5 The cell block mode CBM indicates whether plural cells constitute one functional block. The cell playback informa- 

tion of each cell in a functional block is arranged consecutively in the PGC information. The cell block mode CBM of the 
first cell playback information in this sequence contains the value of the first cell in the block, and the cell block mode 
CBM of the last cell playback information in this sequence contains the value of the last cell in the block. The cell block 
mode CBM of each cell arrayed between these first and last cells contains a value indicating that the cell is a cell 

w between these first and last cells in that block. 

The cell block type CBT identifies the type of the block indicated by the cell block mode CBM. For example, when 
a multiple angle function is enabled, the cell information corresponding to each of the reproducible angles is pro- 
grammed as one of the functional blocks mentioned above, and the type of these functional blocks is defined by a value 
identifying "angle" in the cell block type CBT for each cell in that block. 

15 The seamless playback flag SPF simply indicates whether the corresponding cell is to be linked and played back 
seamlessly with the cell or cell block reproduced immediately therebefore. To seamlessly reproduce a given cell with 
the preceding cell or cell block, the seamless playback flag SPF is set to 1 in the cell playback information for that cell; 
otherwise SPF is set to 0. 

The interleaved allocation flag IAF stores a value identifying whether the cell exists in a contiguous or interleaved 
20 block. If the cell is part of an interleaved block, the flag IAF is set to 1 ; otherwise it is set to 0. 

The STC resetting flag STCDF identifies whether the system time clock STC used for synchronization must be 
reset when the cell is played back; when resetting the system time clock STC is necessary, the STC resetting flag 
STCDF is set to 1. 

The seamless angle change flag SACF stores a value indicating whether a cell in a multi-angle period should be 
25 connected seamlessly at an angle change. If the angle change is seamless, the seamless angle change flag SACF is 
set to 1 ; otherwise it is set to 0. 

The cell presentation time C_PBTM expresses the cell presentation time with video frame precision. 
The first cell VOBU start address C_FVOBU_SA is the VOBU start address of the first cell in a block, and is also 
expressed as the distance from the logic sector of the first cell in the VTS title VOBS (VTSTT_VOBS) as measured by 
30 the number of sectors. 

The last cell VOBU start address C_LVOBU_SA is the VOBU start address of the last cell in the block. The value 
of this address is expressed as the distance from the logic sector of the first cell in the VTS title VOBS (VTSTT_VOBS) 
as measured by the number of sectors. 

The VTS title VOBS (VTSTT_VOBS), i.e., the multimedia system stream data, is described next. The system 
35 stream data VTSTT_VOBS comprises i (where i is a natural number) system streams SS. each of which is referred to 
as a Video object" (VOB). Each video object VOB #1 - VOB #i comprises at least one video data block interleaved with 
up to a maximum eight audio data blocks and up to a maximum 32 sub-picture data blocks. 

Each video object VOB comprises q (where q is a natural number) cells C#1 - C#q. Each cell C comprises r (where 
r is a natural number) video object units VOBU #1 - VOBU #r. 
40 Each video object unit VOBU comprises plural group_of_pictures GOP, and the audio and sub-pictures corre- 
sponding to the playback of said plural group_of_pictures GOP. Note that the group_of_pictures GOP corresponds to 
the video encoding refresh cycle. Each video object unit VOBU also starts with an NV pack, i.e., the control data for that 
VOBU. 

The structure of the navigation packs NV is described with reference to Fig. 18. 

45 Before describing the navigation pack NV, the internal structure of the video zone VZ (see Rg. 22), i.e., the system 
stream St35 encoded by the authoring encoder EC described with reference to Fig. 25, is described with reference to 
Fig. 17. Note that the encoded video stream St15 shown in Fig. 17 is the compressed one-dimensional video data 
stream encoded by the video encoder 300. The encoded audio stream St19 is likewise the compressed one-dimen- 
sional audio data stream multiplexing the right and left stereo audio channels encoded by the audio encoder 700. Note 

so that the audio signal shall not be limited to a stereo signal, and may also be a multichannel surround-sound signal. 

The system stream (title editing unit VOB) St35 is a one dimensional array of packs with a byte size corresponding 
to the logic sectors LS #n having a 2048-byte capacity as described using Fig. 21. A stream control pack is placed at 
the beginning of the title editing unit (VOB) St35 f i.e., at the beginning of the video object unit VOBU. This stream control 
pack is called the "navigation pack N\T, and records the data arrangement in the system stream and other control infor- 

55 mation. 

The encoded video stream St15 and the encoded audio stream St19 are packetized in byte units corresponding to 
the system stream packs. These packets are shown in Fig. 1 7 as packets V1.V2.V3, V4... and A1 , A2, A3.... As shown 
in Fig. 17, these packets are interleaved in the appropriate sequence as system stream St35. thus forming a packet 
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V3. V4, A2. 



to 
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20 
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Note that the sequence shown in Fig. 17 interleaves one video data unit with one audio data unit. 

DVD system is shown in Fig. 18 and described below. example. 
a -n Fin 17 the DacKetized encoded video stream St15 is shown in Fig. 18 as V1. V2. V3, V4. .„ in mis 
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each video object VOB contains plural video object units VOBU. Data sets ot piurai viaeu uuj 
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picture data may be defined as one of up to 32 types *> «i_ specification is written if the stored 

Wrth constant bH rate coding. * e ^ h str ^ the video buffer, thereby making it pos- 

brt ~ and brt rate cod,n9 " 

GOP is not constant. The video and the video and audio data output from 

and audio data read from ^^^r^^^^SS^^ video and audio data is therefore descnbed ,n 
the decoder, do not match. The method of t«me-basej VJJJJ™" . constant brt rate coding, 
detail below with reference to Fig. 26. but is "^^^SSSSS^^ NV starts with a pack header PKH. and 

The navigation pack NV structure is shown in F.g. 20. Each navigauc- P 
contains a PCI packet and DSI packet is to be sent from stream buffer 2400 
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unit ILVU. Otherwise the Unit END flag is set to 0 ^ e |LVU to whicn the vOBU 

An interleaved Unit End Address (ILVU EA) «2^*SJf?S iSmmSlrit Start Address (NTJLVU.SA). 
belongs, and the starting address of the next inter eaved ^ btock. Both the Interleaved Unit End Address 

so are alio declared when a video f^^^^^S^ as the number of sectors from the 
(ILVU_EA) and Next Interleaved Unit Start Address iin _"-v <--_ i 

navigation pack NV of that VOBU. mnnecled M the audio components of the two system streams are not 

When two system streams are seamlessly ' ~" n ^« bur m a «^ ^ audio o^ut to synchronize 

contiguous, particularly immediately before and aft r the _ seam^rt JJ^TSw. « non-contiguous audio may result 
» the audio and video components of ^^^rreS With an NTSC signal, for example, the 
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(VOB_A_STP_PTM1), and Audio Stop PTM2 in VOB (VOB_A_STP_PTM2), indicating the time at which the audio is to 
be paused; and audio reproduction stopping periods 1 and 2, i.e., Audio Gap Length 1 in VOB (VOB_A_GAP_LEN1) 
and Audio Gap Length 2 in VOB (VOB_A_GAP_LEN2), indicating for how long the audio is to be paused, are also 
declared in the DSI packet. Note that these times are specified at the system clock precision (90 kHz). 

5 The Angle Information for Seamless playback (SML_AGLI) declares the read start address when the angle is 

changed. Note that this field is valid when seamless, multi-angle control is enabled. This address is also expressed as 
the number of sectors from the navigation pack NV of that VOBU. Because there are nine or fewer angles, there are 
nine angle address declaration cells: Destination Address of Angle Cell #1 for Seamless playback 
(SML_AGL_C1_DSTA) to Destination Address of Angle Cell #9 for Seamless playback (SML_AGL_C9_DSTA). 

w Note also that each title is edited in video object (VOB) units. Interleaved video objects (interleaved title editing 
units) are referenced as n VOBS"; and the encoded range of the source data is the encoding unit. 

DVD encoder 

15 A preferred embodiment of a digital video disk system authoring encoder ECD in which the multimedia bitstream 
authoring system according to the present invention is applied to a digital video disk system is described below and 
shown in Fig. 25. It will be obvious that the authoring encoder ECD applied to the digital video disk system, referred to 
below as a DVD encoder, is substantially identical to the authoring encoder EC shown in Fig. 2. The basic difference 
between these encoders is the replacement in the DVD encoder ECD of the video zone formatter 1300 of the authoring 

20 encoder EC above with a VOB buffer 1 000 and formatter 1 1 00. It will also be obvious that the bitstream encoded by this 
DVD encoder ECD is recorded to a digital video disk medium M. The operation of this DVD encoder ECD is therefore 
described below in comparison with the authoring encoder EC described above. 

As in the above authoring encoder EC, the encoding system controller 200 generates control signals St9. St11, 
St13. St21 , St23, St25, St33, and St39 based on the scenario data St7 describing the user-defined editing instructions 

25 input from the scenario editor 100, and controls the video encoder 300, sub-picture encoder 500. and audio encoder 
700 in the DVD encoder ECD. Note that the user-defined editing instructions in the DVD encoder ECD are a superset 
of the editing instructions of the authoring encoder EC described above. 

Specifically, the user-defined editing instructions (scenario data St7) in the DVD encoder ECD similarly describe 
what source data is selected from all or a subset of the source data containing plural titles within a defined time period, 

30 and how the selected source data is reassembled to reproduce the scenario (sequence) intended by the user. The sce- 
nario data St7 of the DVD encoder ECD, however, further contains such information as: the number of streams con- 
tained in the editing units, which are obtained by splitting a multi-title source stream into blocks at a constant time 
interval; the number of audio and sub-picture data cells contained in each stream, and the sub-picture display time and 
period; whether the title is a multi-rated title enabling parental lock control; whether the user content is selected from 

35 plural streams including, for example, multiple viewing angles; and the method of connecting scenes when the angle is 
switched among the multiple viewing angles. 

The scenario data St7 of the DVD encoder ECD also contains control information on a video object VOB unit basis. 
This information is required to encode the media source stream, and specifically includes such information as whether 
there are multiple angles or parental control features. When multiple angle viewing is enabled, the scenario data St7 

to also contains the encoding bit rate of each stream considering data interleaving and the disk capacity, the start and end 
times of each control, and whether a seamless connection should be made between the preceding and following 
streams. 

The encoding system controller 200 extracts this information from the scenario data St7, and generates the encod- 
ing information table and encoding parameters required for encoding control. The encoding information table and 
is encoding parameters are described with reference to Figs. 27, 28, and 29 below. 

The stream encoding data St33 contains the system stream encoding parameters and system encoding start and 
end timing values required by the DVD system to generate the VOBs. These system stream encoding parameters 
include the conditions for connecting one video object VOB with those before and after, the number of audio streams, 
the audio encoding information and audio Ids, the number of sub-pictures and the sub-picture Ids, the video playback 
>o starting time information VPTS. and the audio playback starting time information APTS. 

The title sequence control signal St39 supplies the multimedia bitstream MBS formatting start and end timing infor- 
mation and formatting parameters declaring the reproduction control information and interleave information. 

Based on the video encoding parameter and encoding start/end timing signal St9, the video encoder 300 encodes 
a specific part of the video stream St1 to generate an elementary stream conforming to the MPEG2 Video standard 
is defined in ISO-13818. This elementary stream is output to the video stream buffer 400 as encoded video stream St15. 

Note that while the video encoder 300 generates an elementary stream conforming to the MPEG2 Video standard 
defined in ISO-13818, specific encoding parameters are input via the video encoding parameter signal St9, including 
the encoding start and end timing, bit rate, the encoding conditions for the encoding start and end. the material type, 
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including whether the material is an NTSC or PAL video signal or telecine converted material, and whether the encoding 
mode is set for either open GOP or closed GOP encoding. 

The MPEG2 coding method is basically an interframe coding method using the correlation between frames for 
maximum signal compression, i.e., the frame being coded (the target frame) is coded by referencing frames before 

5 and/or after the target frame. However, intra-coded frames, i. e. , frames that are coded based solely on the content of 
the target frame, are also inserted to avoid error propagation and enable accessibility from mid-stream (random 
access). The coding unit containing at least one intra-coded frame ("intra-frame") is called a group_of_pictures GOP. 

A group_of_pictures GOP in which coding is closed completely within that GOP is known as a "closed GOP. " A 
group_of_pictures GOP containing a frame coded with reference to a frame in a preceding or following (ISO-13818 

10 DOES NOT LIMIT P- and B-picture CODING to referencing PAST frames) group_of_pictures GOP is an "open GOP." 
It is therefore possible to playback a closed GOP using only that GOP. Reproducing an open GOP, however, also 
requires the presence of the referenced GOP, generally the GOP preceding the open GOP. 

The GOP is often used as the access unit. For example, the GOP may be used as the playback start point for repro- 
ducing a title from the middle, as a transition point in a movie, or for fast-forward play and other special reproduction 

15 modes. High speed reproduction can be achieved in such cases by reproducing only the intra-frame coded frames in a 
GOP or by reproducing only frames in GOP units. 

Based on the sub-picture stream encoding parameter signal St1 1 , the sub-picture encoder 500 encodes a specif ic 
part of the sub-picture stream St3 to generate a variable length coded bitstream of bitmapped data. This variable length 
coded bitstream data is output as the encoded sub-picture stream St1 7 to the sub-picture stream buffer 600. 

20 Based on the audio encoding parameter signal St 13, the audio encoder 700 encodes a specific part of the audio 
stream St5 to generate the encoded audio data. This encoded audio data may be data based on the MPEG1 audio 
standard defined in ISO-11172 and the MPEG2 audio standard defined in ISO-13818, AC-3 audio data, or PCM 
(LPCM) data. Note that the methods and means of encoding audio data according to these standards are known and 
commonly available. 

25 The video stream buffer 400 is connected to the video encoder 300 and to the encoding system controller 200. The 
video stream buffer 400 stores the encoded video stream St1 5 input from the video encoder 300, and outputs the stored 
encoded video stream St15 as the time-delayed encoded video stream St27 based on the timing signal St21 supplied 
from the encoding system controller 200. 

The sub-picture stream buffer 600 is similarly connected to the sub-picture encoder 500 and to the encoding sys- 

30 tern controller 200. The sub-picture stream buffer 600 stores the encoded sub-picture stream St17 input from the sub- 
picture encoder 500, and then outputs the stored encoded sub-picture stream St1 7 as time-delayed encoded sub-pic- 
ture stream St29 based on the timing signal St23 supplied from the encoding system controller 200. 

The audio stream buffer 800 is similarly connected to the audio encoder 700 and to the encoding system controller 
200. The audio stream buffer 800 stores the encoded audio stream St19 input from the audio encoder 700, and then 

35 outputs the encoded audio stream St19 as the time<lelayed encoded audio stream St31 based on the timing signal 
St25 supplied from the encoding system controller 200. 

The system encoder 900 is connected to the video stream buffer 400, sub-picture stream buffer 600, audio stream 
buffer 800. and the encoding system controller 200, and is respectively supplied thereby with the time-delayed encoded 
video stream St27, time-delayed encoded sub-picture stream St29, time-delayed encoded audio stream St31, and the 

40 system stream encoding parameter data St33. Note that the system encoder 900 is a multiplexer that multiplexes the 
time-delayed streams St27, St29, and St31 based on the stream encoding data St33 (timing signal) to generate title 
editing units (VOBs) St35. 

The VOB buffer 1000 temporarily stores the video objects VOBs produced by the system encoder 900. The format- 
ter 1 100 reads the delayed video objects VOB from the VOB buffer 1000 based on the title sequence control signal St39 
45 to generate one video zone VZ, and adds the volume file structure VFS to generate the edited multimedia stream data 
St43. 

The multimedia bitstream MBS St43 edited according to the user-defined scenario is then sent to the recorder 
1200. The recorder 1200 processes the edited multimedia stream data St43 to the data stream St45 format of the 
recording medium M, and thus records the formatted data stream St45 to the recording medium M. 

50 

DVD decoder 

A preferred embodiment of a digital video disk system authoring decoder DCD in which the multimedia bitstream 
authoring system of the present invention is applied to a digital video disk system is described below and shown in Fig. 
55 26. The authoring decoder DCD applied to the digital video disk system, referred to below as a DVD decoder DCD, 
decodes the multimedia bitstream MBS edited using the DVD encoder ECD of the present invention, and recreates the 
content of each title according to the user-defined scenario. It will also be obvious that the multimedia bitstream St45 
encoded by this DVD encoder ECD is recorded to a digital video disk medium M. 
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The basic configuration of the DVD decoder DCD according to this embodiment is the same as that of the authoring 
decoder DC shown in Fig. 3. The differences are that a different video decoder 3801 (shown as 3800 in Fig. 26) is used 
in place of the video decoder 3800, and a reordering buffer 3300 and selector 3400 are disposed between the video 
decoder 3801 and synthesizer 3500. 
5 Note that the selector 3400 is connected to the synchronizer 2900, and is controlled by a switching signal St103. 

The operation of this DVD decoder DCD is therefore described below in comparison with the authoring decoder DC 
described above. 

As shown in Fig. 26. the DVD decoder DCD comprises a multimedia bitstream producer 2000, scenario selector 
2100, decoding system controller 2300, stream buffer 2400, system decoder 2500, video buffer 2600, sub-picture buffer 
10 2700, audio buffer 2800, synchronizer 2900, video decoder 3801 , reordering buffer 3300, sub-picture decoder 3100, 
audio decoder 3200, selector 3400, synthesizer 3500, video data output terminal 3600, and audio data output terminal 
3700. 

The bitstream producer 2000 comprises a recording media drive unit 2004 for driving the recording medium M; a 
reading head 2006 for reading the information recorded to the recording medium M and producing the binary read sig- 

15 nal St57; a signal processor 2008 for variously processing the read signal St57 to generate the reproduced bitstream 
St61 ; and a reproduction controller 2002. 

The reproduction controller 2002 is connected to the decoding system controller 2300 from which the multimedia 
bitstream reproduction control signal St53 is supplied, and in turn generates the reproduction control signals St55 and 
St59 respectively controlling the recording media drive unit (motor) 2004 and signal processor 2008. 

20 So that the user-defined video, sub-picture, and audio portions of the multimedia title edited by the authoring 
encoder EC are reproduced, the authoring decoder DC comprises a scenario selector 2100 for selecting and reproduc- 
ing the corresponding scenes (titles). The scenario selector 2100 then outputs the selected titles as scenario data to 
the DVD decoder DCD. 

The scenario selector 21 00 preferably comprises a keyboard, CPU, and monitor. Using the keyboard, the user then 

25 inputs the desired scenario based on the content of the scenario input by the DVD encoder ECD. Based on the key- 
board input, the CPU generates the scenario selection data St51 specifying the selected scenario. The scenario selec- 
tor 2100 is connected to the decoding system controller 2300 by an infrared communications device, for example, and 
inputs the generated scenario selection data St51 to the decoding system controller 2300. 

The stream buffer 2400 has a specific buffer capacity used to temporarily store the reproduced bitstream St61 input 

30 from the bitstream producer 2000, extract the volume file structure VFS, the initial synchronization data SCR (system 
clock reference) in each pack, and the VOBU control information (DSI) in the navigation pack NV, to generate the bit- 
stream control data St63. The stream buffer 2400 is also connected to the decoding system controller 2300, to which it 
supplies the generated bitstream control data St63. 

Based on the scenario selection data St51 supplied by the scenario selector 2100, the decoding system controller 

35 2300 then generates the bitstream reproduction control signal St53 controlling the operation of the bitstream producer 
2000. The decoding system controller 2300 also extracts the user-defined playback instruction data from the bitstream 
reproduction control signal St53, and generates the decoding information table required for decoding control. This 
decoding information table is described further below with reference to Figs. 62 and 63. The decoding system controller 
2300 also extracts the title information recorded to the optical disk M from the file data structure area FDS of the bit- 

40 stream control data St63 to generate the title information signal St200. Note that the extracted title information includes 
the video manager VMG, VTS information VTSI. the PGC information entries C_PBI #j, and the cell presentation time 
C_PBTM. 

Note that the bitstream control data St63 is generated in pack units as shown in Fig. 19, and is supplied from the 
stream buffer 2400 to the decoding system controller 2300, to which the stream buffer 2400 is connected. 
45 The synchronizer 2900 is connected to the decoding system controller 2300 from which it receives the system clock 
reference SCR contained in the synchronization control data St81 to set the internal system clock STC and supply the 
reset system clock St79 to the decoding system controller 2300. 

Based on this system dock St79. the decoding system controller 2300 also generates the stream read signal St65 
at a specific interval and outputs the read signal St65 to the stream buffer 2400. Note that the read unit in this case is 
so the pack. 

The method of generating the stream read signal St65 is described next. 

The decoding system controller 2300 compares the system clock reference SCR contained in the stream control 
data extracted from the stream buffer 2400 with the system clock St79 supplied from the synchronizer 2900. and gen- 
erates the read request signal St65 when the system clock St79 is greater than the system clock reference SCR of the 
55 bitstream control data St63. Pack transfers are controlled by executing this control process on a pack unit. 

Based on the scenario selection data St51. the decoding system controller 2300 generates the decoding signal 
St69 defining the stream Ids for the video, sub-picture, and audio brtstreams corresponding to the selected scenario, 
and outputs to the system decoder 2500. 
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When a title contains plural audio tracks, e.g. audio tracks in Japanese, English, French, and/or other languages, 
and plural sub-picture tracks for subtitles in Japanese. English, French, and/or other languages, for example, a discrete 
ID is assigned to each of the language tracks. As described above with reference to Fig. 19, a stream ID is assigned to 
the video data and MPEG audio data, and a substream ID is assigned to the sub-picture data, AC-3 audio data, linear 

5 PCM data, and navigation pack NV information. While the user need never be aware of these ID numbers, the user can 
select the language of the audio and/or subtitles using the scenario selector 2100. if English language audio is 
selected, for example, the ID corresponding to the English audio track is sent to the decoding system controller 2300 
as scenario selection data St51. The decoding system controller 2300 then adds this ID to the decoding signal St69 
output to the system decoder 2500. 

w Based on the instructions contained in the decoding signal St69, the system decoder 2500 respectively outputs the 
video, sub-picture, and audio bitstreams input from the stream buffer 2400 to the video buffer 2600, sub-picture buffer 
2700, and audio buffer 2800 as the encoded video stream St71 , encoded sub-picture stream St73, and encoded audio 
stream St75. Thus, when the stream ID input from the scenario selector 2100 and the pack ID input from the stream 
buffer 2400 match, the system decoder 2500 outputs the corresponding packs to the respective buffers (i.e., the video 

is buffer 2600, sub-picture buffer 2700, and audio buffer 2800). 

The system decoder 2500 detects the presentation time stamp PTS and decoding time stamp DTS of the smallest 
control unit in each bitstream St67 to generate the time information signal St77. This time information signal St77 is sup- 
plied to the synchronizer 2900 through the decoding system controller 2300 as the synchronization control data St81 . 
Based on this synchronization control data St81 , the synchronizer 2900 determines the decoding start timing 

20 whereby each of the bitstreams will be arranged in the correct sequence after decoding, and then generates and inputs 
the video stream decoding start signal St89 to the video decoder 3801 based on this decoding timing. The synchronizer 
2900 also generates and supplies the sub-picture decoding start signal St91 and audio stream decoding start signal 
St93 to the sub-picture decoder 3100 and audio decoder 3200, respectively. 

The video decoder 3801 generates the video output request signal St84 based on the video stream decoding start 

25 signal St89, and outputs to the video buffer 2600. In response to the video output request signal St84, the video buffer 
2600 outputs the video stream St83 to the video decoder 3801 . The video decoder 3801 thus detects the presentation 
time information contained in the video stream St83, and disables the video output request signal St84 when the length 
of the received video stream St83 is equivalent to the specified presentation time. A video stream equal in length to the 
specified presentation time is thus decoded by the video decoder 3801 , which outputs the reproduced video signal St95 

30 to the reordering buffer 3300 and selector 3400. 

Because the encoded video stream is coded using the interframe correlations between pictures, the coded order 
and display order do not necessarily match on a frame unit basis. The video cannot, therefore, be displayed in the 
decoded order. The decoded frames are therefore temporarily stored to the reordering buffer 3300. The synchronizer 
2900 therefore controls the switching signal Stl03 so that the reproduced video signal St95 output from the video 

35 decoder 3800 and the reordering buffer output St97 are appropriately selected and output in the display order to the 
synthesizer 3500. 

The sub-picture decoder 3100 similarly generates the sub-picture output request signal St86 based on the sub-pic- 
ture decoding start signal St91, and outputs to the sub-picture buffer 2700. In response to the sub-picture output 
request signal St86, the sub-picture buffer 2700 outputs the sub-picture stream St85 to the sub-picture decoder 3100. 

40 Based on the presentation time information contained in the sub-picture stream St85, the sub-picture decoder 3100 
decodes a length of the sub-picture stream St85 corresponding to the specified presentation time to reproduce and 
supply to the synthesizer 3500 the sub-picture signal St99. 

The synthesizer 3500 superimposes the selector 3400 output with the sub-picture signal St99 to generate and out- 
put the video signal St1 05 to the video data output terminal 3600. 

45 The audio decoder 3200 generates and supplies to the audio buffer 2800 the audio output request signal St88 
based on the audio stream decoding start signal St93. The audio buffer 2800 thus outputs the audio stream St87 to the 
audio decoder 3200. The audio decoder 3200 decodes a length of the audio stream St87 corresponding to the specif ied 
presentation time based on the presentation time information contained in the audio stream St87, and outputs the 
decoded audio stream St101 to the audio data output terminal 3700. 

so It is thus possible to reproduce a user-defined multimedia bitstream MBS in real-time according to a user-defined 
scenario. More specifically, each time the user selects a different scenario, the DVD decoder DCD is able to reproduce 
the title content desired by the user in the desired sequence by reproducing the multimedia bitstream MBS correspond- 
ing to the selected scenario. 

It should be noted that the decoding system controller 2300 may supply the title information signal St200 to the sce- 
55 nario selector 2100 by means of the infrared communications device mentioned above or another means. Interactive 
scenario selection controlled by the user can also be made possible by the scenario selector 2100 extracting the title 
information recorded to the optical disk M from the file data structure area FDS of the bitstream control data St63 con- 
tained in the title information signal St200, and displaying this title information on a display for user selection. 



21 



TTID-<EP 084 7200 A 1 I > 



EP 0 847 200 A1 

Note further, that the stream buffer 2400. video buffer 2600. sub-picture buffer 2700 audio ^f^ 2 ^.^^ 
derina buffer 3300 are expressed above and in the figures as separate entities because they are funcbonally different 
1&£££*££. that a single buffer memory can be controlled to provide the sarne d.scrje funct.ona.rty by 
2-share controlled use of a buffer memory wrth an operating speed plural times faster than the read and wrrte rates 
of these separate buffers. 

Multi-scene control 

The cone* 01 multiple angle scene control acco-dint, lo the present invention is descrihed beta. .with reteteooe 
to FiT 2 r^escobed above, titles that can be played back wtlh numerous uauattons a.e «">^ 
^otpUdstrontainlns^^ 

e and 7, between scenes 5 and 8. are the multi-scene periods. ranrr ^. 
^enesTaken from different angles, i.e.. angles 1 , 2. and 3 in this example, can be dynam.cally selected and repro- 

use?«n S^oZlkitfi seines, and scene 6 has been preselected for output in the parerrtal locked 

TZZ^Sw*^ «. — «• free, y select any of the mu,ti - an9,e scenes - and 506,16 7 

Dreselected for output in the parental locked scene period. M . _ Q prir , , 

Preference to Figs. 30 and 31. furthermore, the contents of the program cha.n 'nforrnaton VTS.PGCI ,s 
descSd In Ft Tso the "case that a scenario requested by the user is shown wrth respeetto a VTS. data ^uctiorv 
™7=renario 1 1 and scenario 2 shown in Fig. 21 are described as program cha.n information vTS_PGC#1 and 
VTS pS ^fSS^SmJLZ the scenario 1 consists of ceil playback information C_PBI#1 corresponding to 
s^e TTpbS I C PBI#3. and C_PB.#4 wrthin a murti-ang.e cell block. C_PB.#5 corresponding to scene 5. 
r PRiiifi corresoondina to scene 6. and C PBI#7 corresponding to scene 8. 

"™- ^SdlscrSng the scenario 2 consists of cell playback information C_PB.#1 corresporxl^ to scene 1. 
C PbT5 C PbT#3 and C PBI#4 wrthin a multi-angle cel. block corresponding to a mult-angle scene. ^™5oow- 
] ' ,„T„„qc rpmm corresoondina to scene 7 and C PBI#7 corresponding to scene 8. According to the dig- 

control unrt of a scenario is described as a cell which is a unit 

^ Ro" 3 Ttoe ^^Ts^r^Z^f^^ h H» 21 is shown wrth respect to a VOB data 
conl^on^^ 

SnShTwt to"a single scene commonly owned by each scenario. VOB#1 corresponding toscene 1 -^EMS cor- 
[SSoSg to^cene 5. and VOB#8 corresponding to scene 8 are arranged in non-,nterleaved block wh,ch .s the 

^Kre^Sto the multi-angle data commonly owned by scenarios 1 and 2. one angle scene date is constructed 
«■», « ^nole vSf SoSicX leaking angle 1 is constructed by VOB#2. and angle 2 is constructed by VOB#3. angle 
tiSS^S 1 !^^^^ multi-angle data is formed as the interleaved block for the sake 
fno^b^en each anSeWKd seamless reproduction of each angle data. Scenes 6 and 7 pecuUar to scenanos 1 and 2 
respeSy. ?e formed as the interleaved block for the sake of seamless reproduct.cn between common scenes before 
and behind thereof as well as seamless reproduction between each scene. 

As deJnSin toe above, the user's requesting scenario shown in Fig. 21 can be realized by iMzmg the v,deo 
trtto £££!Zm information shown in Fig. 30 and the title playback VOB data structure shown ,n F.g. 31 

Seamless playback 

The seamless playback capability briefly mentioned above wrth regard to the digital video disk system data itrue- 
tore fe MtadbSTSs thai seamless playback refers to the reproduction in a digital video djsk system ofmuto- 
mia SSZSTwh and sub-picture data without intermittent breaks in the data or ,rfformat,onl between 

ousl^oducS for a constant time unrt in order for the user to understand the content or .nforrnaton. e.g.. audio data. 
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data continuity is lost when the required continuous presentation time cannot be assured. Reproduction of such infor- 
mation whereby the required continuity is assured is referred to as "contiguous information reproduction," or "seamless 
information reproduction." Reproduction of this information when the required continuity cannot be assured is referred 
to as "non-continuous information reproduction," or "non-seamless information reproduction." It is obvious that contin- 
5 uous information reproduction and non-continuous information reproduction are, respectively, seamless and non-seam- 
less reproduction. 

Note that seamless reproduction can be further categorized as seamless data reproduction and seamless informa- 
tion reproduction. Seamless data reproduction is defined as preventing physical blanks or interruptions in the data play- 
back (intermittent reproduction) as a result of a buffer underflow state, for example. Seamless information reproduction 
w is defined as preventing apparent interruptions in the information when perceived by the user (intermittent presentation) 
when recognizing information from the playback data where there are no actual physical breaks in the data reproduc- 
tion. 

Details of Seamless playback 

15 

The specific method enabling seamless reproduction as thus described is described later below with reference to 
Figs. 23 and 24. 

Interleaving 

20 

The DVD data system streams described above are recorded using an appropriate authoring encoder EC as a 
movie or other multimedia title on a DVD recording medium. Note that the following description refers to a movie as the 
multimedia title being processed, but it will be obvious that the invention shall not be so limited. 

Supplying a single movie in a format enabling the movie to be used in plural different cultural regions or countries 
25 requires the script to be recorded in the various languages used in those regions or countries. It may even necessitate 
editing the content to conform to the mores and moral expectations of different cultures. Even using such a large-capac- 
ity storage system as the DVD system, however, it is necessary to reduce the bit rate, and therefore the image quality, 
if plural full-length titles edited from a single common source title are recorded to a single disk. This problem can be 
solved by recording the common parts of plural titles only once, and recording the segments different in each title for 
30 each different title only. This method makes it possible to record plural titles for different countries or cultures to a single 
optical disk without reducing the bit rate, and, therefore, retaining high image quality. 

As shown in Fig. 21 , the titles recorded to a single optical disk contain basic scene periods of scenes common to 
all scenarios, and multi-scene periods containing scenes specific to certain scenarios, to provide parental lock control 
and multi-angle scene control functions. 
35 In the case of the parental lock control function, titles containing sex scenes, violent scenes, or other scenes 
deemed unsuitable for children, i.e.. so-called "adult scenes." are recorded with a combination of common scenes, adult 
scenes, and children's scenes. These title streams are achieved by arraying the adurt and children's scenes to multi- 
scene periods between the common basic scene periods. 

Multi-angle control can be achieved in a conventional single-angle title by recording plural multimedia scenes 
40 obtained by recording the subjects from the desired plural camera angles to the multi-scene periods arrayed between 
the common basic scene periods. Note, however, that while these plural scenes are described here as scenes recorded 
from different camera angles (positions), it will be obvious that the scenes may be recorded from the same camera 
angle but at different times, data generated by computer graphics, or other video data. 

When data is shared between different scenarios of a single title, it is obviously necessary to move the laser beam 
45 LS from the common scene data to the non-common scene data during reproduction, i.e., to move the optical pickup to 
a different position on the DVD recording medium RC1 . The problem here is that the time required to move the optical 
pickup makes it difficult to continue reproduction without creating breaks in the audio or video, i.e., to sustain seamless 
reproduction. This problem can be theoretically solved by providing a track buffer (stream buffer 2400) to delay data out- 
put an amount equivalent to the worst access time. In general, data recorded to an optical disk is read by the optical 
so pickup, appropriately processed, and temporarily stored to the track buffer. The stored data is subsequently decoded 
and reproduced as video or audio data. 

Specific Problem in Interleaving 

55 The operation of the stream buffer 2400. or a track buffer, in this DVD system is described briefly below. 

Because rotational speed control of the optical disk drive cannot instantaneously adjust the disk speed, the input 
to the stream buffer 2400, i.e., the transfer rate Vr from the optical disk, is a basically constant rate. Output from the track 
buffer, i.e., the transfer rate Vo to the decoder, however, varies according to the image quality, user intent, and even the 
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variable bit rate of the compressed video data in the DVD system. The from-disk transfer rate Vr is therefore constant 
at approximately 11 Mbps in the DVD system, while the transfer rate Vo from the buffer is variable to a maximum 10 
Mbps. 

Because of this gap between the input and output transfer rates Vr and Vo, the stream buffer 2400 will overflow if 

s data is transferred continuously from disk Therefore, to prevent the stream buffer 2400 of the reproduction apparatus 
overflowing, data is transferred intermittently from the disk. The stream buffer is controlled during normal continuous 
reproduction to always be in a near-overflow state. 

It is therefore possible by using a stream buffer 2400 to sustain continuous, uninterrupted data output (reproduc- 
tion) even if there is a slight break in data reading as a result of the read head (optical pickup) 2006 jumping to move 

io between logic sectors LS on the disk. In an actual reproduction apparatus, however, the jump time may vary between 
200 msec and 2 sec depending upon the distance and position on the disk M. While it is possible to use a stream buffer 
2400 with capacity sufficient to absorb this jump time, the compression bit rate is an average 4 - 5 Mbps and a maximum 
10 Mbps when using a large-capacity optical disk M storing high quality images. Assuring seamless reproduction when 
jumping from any disk position to any other with this system imposes significant memory requirements, resulting in an 

is expensive decoder DC. Because the amount of memory that can be built in to the decoder DC is therefore limited if a 
cost-effective product is to be marketed, other limitations are imposed, e. g., limiting the jump time to a time whereby 
data can be continuously reproduced. 

The relationship between the accumulated data quantity in the stream buffer 2400 and the operating mode of the 
reading head 2006 is shown in Fig. 32. In Fig. 32, period Tr is the period during which the optical pickup reads data from 

20 the optical disk RC, and period Tj is the jump period during which the optical pickup moves between logic sectors. Line 
L1 expresses the change in data volume Vd stored in the stream buffer 2400 during the data read period Tr. Line L2 
expresses the change in data volume Vd stored in the stream buffer 2400 during the jump period Tj. 

During the data read period Tr, the reading head 2006 reads data from the optical disk M at the transfer rate Vr, and 
stores the read data directly to the stream buffer 2400. The stream buffer 2400 supplies the data to the decoders 3801 , 

25 3100, and 3200 at the transfer rate Vo. As a result, data accumulates in the stream buffer 2400 during the data read 
period Tr at the rate Vr-Vo, i.e., the difference between these two transfer rates. 

Because the reading head 2006 is moving to access a different disk position during the jump period Tj, data read 
from the optical disk M cannot be stored to the stream buffer 2400. Data continues to be output from the stream buffer 
2400 to the decoders 3801 , 3100, and 3200 during this period, however, and the accumulated data volume Vd remain- 

30 ing in the stream buffer 2400 decreases at the rate at which data is transferred to the decoders, i.e.. transfer rate Vo. 

Note that while the transfer rate Vo to the decoders is shown in Fig. 32 as a constant rate, data transfers to the 
decoders actually occur intermittently because the decoding time varies according to the data type. These differences 
are omitted to simply this description of the buffer underflow concept. This is equivalent to the reading head 2006 read- 
ing data intermittently during the jump period Tj, even through the reading head 2006 continuously reads from the opti- 

35 cal disk M at a constant linear velocity (CLV). The slopes LI and L2 of lines LI and L2 above can be expressed by the 
following equations. 

L1= Vr-Vo [1] 
40 L2 = Vo [2] 

Underflow occurs, and decoding stops, when the jump period Tj is long enough for stream buffer 2400 to empty. If 
the jump period Tj is held to less than the time required for the stream buffer 2400 to empty, however, decoding can con- 
tinue without data interruptions. This time at which the reading head 2006 can jump to another disk position without a 
45 data underflow state occurring in the stream buffer 2400 is referred to as the jump-enabled time. 

Note that while physical movement of the reading head 2006 has been described as a factor causing a data under- 
flow state in the stream buffer 2400. the following factors also contribute. Note, further, that this data underflow state 
can be simply characterized as a condition in which data input to the track buffer (stream buffer 2400) does not keep 
pace with data output from the track buffer. 

50 

a) The buffer is too small relative to the decoding speed of the decoder. 

b) The sizes of the individual input units of the plural VOB types carried in the reproduced bitstream St61 input to 
the stream buffer 2400 from the bitstream producer 2000 are inappropriate for the buffer size. 

c) The sequence of the individual input units of the plural VOB types carried in the reproduced bitstream St61 is 
>5 inappropriate for the decoding speed, resulting in the next data unit to be decoded not being received in sufficient 

time while decoding a particular data unit. 

When the reproducing apparatus of the digital video disk system is configured with a disk read rate of 1 1 Mbps. a 
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maximum AV data compression rate of 10 Mbps, and a track buffer (stream buffer 2400) capacity of 4 Mbits, for exam- 
ple, a data underflow state will occur. To prevent this data underflow state from occurring during the jump period Tj in a 
reproduction apparatus thus configured with slight overflow control maintained during normal continuous playback, a 
maximum jump-enabled time of 400 msec can be assured even in a worst-case scenario where 10 Mbps of AV data is 
reproduced during the jump period. 

Note that a jump-enabled time of 400 msec is a realistic value that can be achieved in actual reproduction appara- 
tuses. In such an apparatus the distance that can be jumped during a 400 msec period is approximately 500 tracks. It 
follows that the jump-enabled time can also be defined in data quantity terms, specifically the volume of the sequential 
data stream that the reading head 2006 can move through within the jump-enabled time. The data volume equivalent 
to a 400 msec jump-enabled time is approximately 250 Mbits. It will also be obvious that the actual distance in sector 
and track units on the recording medium can also be easily obtained from the data volume defined as the jumpable dis- 
tance based on the recording method and density of the recording medium. 

A 250-Mbit jumpable distance is equivalent to a 50 sec. presentation time assuming AV data reproduced at an aver- 
age 5 Mbits/sec. With even higher quality AV data, the presentation time is less than 50 sec. In movies and other data 
streams edited to remove particular scenes due to cultural, educational, or other reasons, the excised scenes are typi- 
cally long, usually ranging from 2 to 5 minutes, and even as long as 10 minutes. If the excised scene lasts 5 minutes, 
for example, with the above reproduction apparatus, it is not possible to uninterruptedly connect the scenes before and 
after an excised scene if the before scene, excised scene, and after scene are recorded to disk in a single contiguous 
series. More specifically, the data corresponding to a continuous 5 minute scene cannot be jumped in a single jump 
operation. 

There are also cases in which the AV data compression rate, i.e., the consumption rate Vo of data from the track 
buffer, approaches 10 Mbps and non-occurrence of a buffer underflow state cannot be assured, even when the exdsed 
scene data is jumped using a jump period Tj exceeding 400 msec. While such a data underflow state can conceivably 
be avoided by recording two AV data scenarios, i.e., excised (edited) and un-excised (unedited) scenarios, to disk, this 
method does not make effective use of the available disk capacity. If it is also necessary to record duplicate data for 
much presentation time, the AV data will be low quality and it will be difficult to satisfy most user expectations. 

The concept of sharing data between plural titles is described below with reference to Fig. 33 wherein TL1 and TL2 
refer to the data content of a first and second title, respectively. Specifically, the first title TL1 consists of data DbA, DbB, 
and DbD reproduced contiguously during time T, and the second title TL2 similarly consists of data DbA, DbC, and DbD 
reproduced contiguously during time T. Note that data DbA, DbB, DbC, and DbD are video objects VOB with display 
times T1, T2, T2, and T3, respectively. When two titles TL1 and TL2 thus comprised are recorded to disk, the data 
stream is structured with data DbA and DbD as shared data of both titles TL1 and TL2, and data DbB and DbC as the 
discrete data of titles TL1 and TL2, respectively, selectively reproduced during selection period T2 as shown by title 
TL1_2. Note that while there appears to be a time gap between the data blocks in Fig. 33, this is simply to fadlitate the 
use of arrows in the figure for easier description of the data reproduction paths. There is no actual time gap during play- 
back. 

The format whereby these data blocks DbA, DbB, DbC, and DbD are recorded to disk in title TL1_2 to enable con- 
tiguous reproduction of the data is shown in Fig. 34. In principle, data blocks DbA, DbB, DbC, and DbD composing a 
single contiguous title are recorded to contiguous segments of the data recording track TR (Fig. 9). Thus, data DbA, 
DbB, and DbD of first title TL1 are first recorded contiguously, and are followed by data DbC unique to the second title 
TL2. With this data format, the first title TL1 can be contiguously reproduced without breaks in the title content, i.e., 
seamlessly, by means of controlling the reading head 2006 to access data DbA, DbB, and DbD synchronized to play- 
back periods T1 , T2, and T3, respectively. 

To reproduce the second title TL2, however, the reading head 2006 must jump past data DbB and DbD after repro- 
ducing data DbA in period T1, and must reach data DbC before the start of period T2 as shown by arrow Sq2a in Rg. 
34. Then after reprodudng data DbC, the reading head 2006 must jump back to data units DbC and DbD as shown by 
arrow Sq2b, and reach data DbD before the start of period T3. Because of the time required to move the reading head 
2006 between data blocks, it is not possible to assure seamless reproduction between data DbA and DbC. and between 
DbC and DbD. unless the distance traveled between these data blocks requires less time than the time required for a 
data underflow state to occur in the stream buffer 2400. 

Definition qf Interleaving 

To thus enable the user to selectively exdse scenes and choose from among plural scenes, a state wherein non- 
selected scene data is recorded inserted between common scene data and selective scene data necessarily occurs 
because the data units associated with individual scenes are contiguously recorded to the recording tracks of the 
recording medium. If data is then read in the recorded sequence, non-selected scene data must be accessed before 
accessing and decoding the selected scene data, and seamless connections with the selected scene is difficult. The 
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excellent random access characteristics of the digits, video disK system, however, make seamless connections with the 

selected scenes possible. «~™iioH riata <^ize and interleaving plural split 

,n other words, by splitting scene-specific data into plural «*£"P£j£ 2££ range whereby an 

data units for different scenes in a P^|^ JJ^^i^SSS sSnefw^hout d^S intenuptions by inter- 

ntent^ 
reproduction is thereby assured. 

notaiiad Def inr*™ of mtPrleavina 

The seamless connection method and data splitting and anaying concepts of the present ^nvention are described 
below^^ 

Referring again to Fig. 32. the data consumption rate Vc ns less than js then ^^ed 

ence. a known data volume is read at the transfer rate Vr. buffered to ^dk^^anos underflow 

interleave units after interleaving are called "interleaved m , ural video a>)e<te vOBs 

leave distance. --rw,* ,«ina an cotical disk recording medium. If this 10.000- 

For example. 260 msec is required to move 0.000 sectore using an 5 y , LVU ^ be dete r- 

sector movement of the optical pickup is *e.nterieave unri djsterrce Tr an?Vo, and the size of the track 

mined from the difference between the track buHennptf and using an input transfer rate 

buffer. Let us assume, for example "the distance between 

Vr °V i rT,LV a SrsToCos re e2^ 

in general. 2080 Kbits is equivalent to 260 msec of playback date. T °*™™™l r tor 0 . 7 sec . at the difference 
"^between — 

between transfer rates Vr and Vo (- 208C «> « ™" > consumption by the decoder while the 

SX™g^h^ 

presentation time, this makes it possible to .nterleave ""to w*. a g^J^JJ^,^ corlumption bit rate. 

Si. to increase the iumpable time whanging »e inter ^ ^ A cQn . 

One method of connecting scenes ,s shown in F.g_35 where ^the toHow. g q 
nected to scene D; scene A connected to scene B^s J^^^^J^i * sc ? ne D but for a different 
is scene A connected to scene C. which ,s simHarly ^^^^ pigments (scenes) D-1 , D-2. and D- 

;o be jumped between connected scenes as described above B c> and q in this exam- 

However, when scenes with the same starting points ^^^^^t^^r^Sng to scene D-1 . 
p,e. are interleaved, three different system f^^^^^X^neot and procLing tends to 
two system streams become interleaved dunng ^ p«d» res ^^ d ^~ VOBtoirterleave Ejects having 
become complex, It is therefore more common .^ en ^' e ^ ,n 9 £^Lr.shed as shown in Rg. 36 by copy- 

* the same starting and end poir^thus jmpl^ng p ££2ich plural scenes dh,erge and con- 
ing and appending scent iMJ scene C *^^.™2S^ objects VOB. When interleaving branching and 

r^SSS 2 SSS - — a ' e ~— - the stertin9 - , end 
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points thereof aligned. 

The interleaving concept is described in <^«**^*"V exgmp|e o{ Cleaving with time information. This 
The audio-video (AV) system stream described above s one ^example cnm y to tne buffer at 

interring method arrays audio and video ^^^^^"SS^ll- **™ presentation time 
approximately the same time is in physical P roxim *» a ^^^ scene in titles such as movies, 

ane arrayed in alternating sequence. When one scene m ibe. wpjwd «m a interleaving method 

however the time between these plural scenes often d ^J™^ ™ ™ Reabsorbed by the buffer if it is within 
£ above is applied in such cases the ^™^,^S£ hoJeve, the buffer cannot 
the jumpable time. H the between-scene time d.fference is greater than jump 

absorb the difference and seamless playback .s not be accumu , a ted at one time, a longer 
If the track buffer size is increased to increase the amount of da ^ mat can interactive opera- 

jump aSe time is obtained, and the interleave unrt ;"^|^ «• ^ie^fwih seamless playback, the pres- 
ion enabling the user to select from among P^^^^^SSS) is changed » the interleaved units ILVU 
T^C^t^T^^ £ i^id. Th^so resuKs in more time being required 
for the displayed stream to change. e * roam ar ranaement in data block units to prevent a data 

j-» d =«^= :c - - — - - ^ *- 

,ess*S>rs such as the decoding speed «^^ g ,n * < JT B Smeope=e l c.eK M .oreaddet.. 
opScal pickup ie e problem when the «M J^^^SSSTSi* TR 0. the optioal disk M. the decoding 
SeneaumgSalso needed «hen recording '^^^^^^^s^sjeamie supped direOly as cKXturs 
speed ol the immunisations system can ^*^E££IE£ hroadcast eateWtes aod other radio wave 

Svi^o^m^^^^ 

^source oata groups otpiural t»|*^^^SX^^ ta *-- 
5te ^WM^SCdSi — ^ in the input o. the target source data that 

It shcild he noted, however, that en intedea tnnoj m«hod t» „ sceoanos in a manner 

^eoTe^^ 
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seamless playback. video obie cts are distributed to positions (navigation 

When applied to a DVD system as d -?*«^^*J^^S of video data compression. The GOP data 
packs NV) within a specific time-base range bounded byGOF ' p rocesses or infra-frame coding inserted for 

length, however, may vary as a result of user-defmec ^^^'^^^gation packs NV). which are depend- 
hX image quality. This means that the changes or the optica, 

ent upon the presentation time, can vary. This means that the point at jf , , ang ,es are inter- 

P^uplumpstUenex^^ is known. Thus, the data 

leaved together, the length of contiguously read date _,s ^ndef mrte ^ ' ^ data is thus dela yed. 

wfth re,erence to the accompany,n9 f ,gures - 



m ^ax/ed Hq \+ fT 1 UtifiUgayS unit 
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The interleaving me*=d en**,g — -» "S^S^^SSSSS-^ 
below with reference to Fig. 24 and Fig. 37. Shown m Rg. 24 is V^L^JoB_g VOB . c , and VOB-D. and then merging 

.... . Q7 



TR on disk is shown in Fig. 37. 
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Referring to Fig. 37, VOB-A and VOB-E are video objects with independent playback start and end times, and are 
in principle arrayed to contiguous block regions. As shown in Fig. 24, the playback start and end times of VOB-B, VOB- 
C, and VOB-D are aligned during interleaving. The interleaved data blocks are then recorded to disk to a contiguous 
interleaved block region. The contiguous block regions and interleaved block regions are then written to disk in the track 
5 path Dr direction in the playback sequence. Plural video objects VOB, i.e., interleaved video objects VOBS, arrayed to 
the data recording track TR are shown in Fig. 37. 

Referring to Fig. 37, data regions to which data is continuously arrayed are called "blocks," of which there are two 
types: "contiguous block regions" in which VOB with discrete starting and end points are contiguously arrayed, and 
"interleaved block regions" in which plural VOB with aligned starting and end points are interleaved. The respective 
w blocks are arrayed as shown in Fig. 38 in the playback sequence, i.e., block 1, block 2, block 3, . . . block 7. 

As shown in Fig. 38, the VTS title VOBS (VTSTT_VOBS) consist of blocks 1 - 7, inclusive. Block 1 contains VOB 1 
alone. Blocks 2, 3, 5, and 7 similarly discretely contain VOBS 2, 3, 6, and 10. Blocks 2, 3, 5, and 7 are thus contiguous 
block regions. 

Block 4, however, contains VOB 4 and VOB 5 interleaved together, while block 6 contains VOB 7, VOB 8, and VOB 

75 9 interleaved together. Blocks 4 and 6 are thus interleaved block regions. 

The internal data structure of the contiguous block regions is shown in Fig. 39 with VOB-i and VOB-j arrayed as the 
contiguous blocks in the VOBs. As described with reference to Fig. 16, VOB-i and VOB-j inside the contiguous block 
regions are further logically divided into cells as the playback unit. Both VOB-i and VOB-j in this figure are shown com- 
prising three cells CELL #1 , CELL #2, and CELL #3. 

20 Each cell comprises one or more video object unit VOBU with the video object unit VOBU defining the boundaries 
of the cell. Each cell also contains information identifying the position of the cell in the program chain PGC (the playback 
control information of the digital video disk system). More specifically, this position information is the address of the first 
and last VOBU in the cell. As also shown in Fig. 39, these VOB and the cells defined therein are also recorded to a con- 
tiguous block region so that contiguous blocks are contiguously reproduced. Reproducing these contiguous blocks is 

25 therefore no problem. 

The internal data structure of the interleaved block regions is shown in Fig. 40. In the interleaved block regions each 
video object VOB is divided into interleaved units ILVU. and the interleaved units ILVU associated with each VOB are 
alternately arrayed. Cell boundaries are defined independently of the interleaved units ILVU. For example, VOB-k is 
divided into four interleaved units ILVUkl , ILVUk2, ILVUk3, and ILVUk4, and are confined by a single cell CELL#k. VOB- 
30 k is likewise divided into four interleaved units ILVUml , ILVUm2, ILVUm3, and ILVUm4, and is confined by a single cell 
CELL#m. Note that instead of a single cell CELL#k or CELL#m, each of VOB-k and VOB-m can be divided into more 
than two cells. The interleaved units ILVU thus contains both audio and video data. 

In the example shown in Fig. 40, the interleaved units ILVUkl, ILVUk2, ILVUk3, and ILVUk4, and ILVUml, ILVUm2, 
ILVUm3, and ILVUm4, from two different video objects VOB-k and VOB-m are alternately arrayed within a single inter- 
35 leaved block. By interleaving the interleaved units ILVU of two video objects VOB in this sequence, it is possible to 
achieve seamless reproduction branching from one scene to one of plural scenes, and from one of plural scenes to one 
scene. This interleaving process further enables seamless reproduction of scenes from various scenario threads in 
most cases. 

40 An alternative for achiev ing Interleaving 

Seamless reproduction is even possible when there are three possible scenario threads as shown in Fig. 35, i.e., 
from scene A to scene B, and then to scene D-3 starting at some midpoint in scene D; from scene A to the beginning 
of scene D; and from scene A to scene C, and then to scene D-2 starting at a midpoint in scene D different from D-3. 
45 As shown in Fig. 36, it is also possible by connecting before and after scenes (scene D-2) to align the starting and end 
points according to the data structure of the present invention. Note that this process of copying scenes to align the 
starting and end points makes it possible to seamlessly reproduce even complicated threading of different scenes. 

An interleaving algorithm compatible with variable leng th coded data 

50 

An example of an interleaving algorithm compatible with variable length coded data, i.e., video data, is described 
next 

When plural VOB are interleaved, each VOB is basically divided into the same particular number of interleave units. 
It is also possible to obtain the data volume (size) of each of this particular number of interleave units based on the bit 
;5 rate of the VOB to be interleaved, the jump time and distance that can be jumped within this jump time, the track buffer 
capacity and the input rate Vr to the track buffer, and the position of the video object unit VOBU. Each interleaved unit 
consists of video object units VOBU. each of which comprises one or more MPEG GOP with a data quantity equivalent 
to a 0.4 - 1.0 second presentation time. 
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When data is interleaved, the interleaved units ILVU forming separate video objects VOB are alternately arrayed. If 
any of the plural interleave units interleaved to the shortest of plural VOB is shorter than the minimum interleave unit 
length, or if the total length of plural interleave units in any but the shortest of plural VOB, is greater than the shortest 
interleave distance, an underflow state will occur when this shortest interleave VOB is reproduced, and non-seamless 
5 reproduction will result instead of seamless reproduction. 

Before encoding occurs in the present embodiment it is therefore determined whether interleaving is possible. 
Whether interleaving is possible can be determined from the lengths of the pre-encoded streams. Because the effect 
of interleaving can thus be determined in advance, the need for data reprocessing, i.e., the need to adjust the interleav- 
ing conditions and record after interleaving and encoding are once completed, can be prevented. 
10 The interleaving method for recording to an optical disk according to the present invention is described specifically 
below, starting with the bit rate of the recorded VOB and the performance and other conditions of the reproduced disk. 

When interleaving is applied, the relationship between the track buffer input transfer rate Vr and output transfer rate 
Vo is defined as Vr > Vo as previously described. The maximum bit rate of each VOB to be interleaved is therefore set 
to less' than the track buffer input transfer rate Vr. The maximum bit rate B of each VOB is therefore set to a value less 
is than Vr. If in the evaluation determining whether interleaving enabling seamless reproduction is possible it is assumed 
that all plural VOB to be interleaved are encoded with constant bit rate (CBR) coding at the maximum bit rate, the inter- 
leave unit data volume is maximized, the time that can be reproduced from the data volume that can be placed at the 
jumpable distance is shortened, and the interleave conditions are stringent. 

Note that it is assumed below that each VOB is encoded with constant bit rate (CBR) coding at the maximum bit 

20 rate. 

It is assumed in the following description of the reproduction apparatus that the distance the optical pickup can 
jump within disk jump time JT expressed as a data quantity is jumpable distance JM, and the input data bit rate to the 
track buffer of the reproduction apparatus is BIT. 

In terms of an actual reproduction apparatus, the disk jump time JT is 400 msec, the jumpable distance JM is 250 
25 Mbit in this jump time JT. The maximum bit rate B of the VOB in the MPEG system is defined as 8.8 Mbps considering 
that an average 6 Mbps is required to obtain image quality superior to that of a conventional VCR recording. 

Based on the jumpable distance JM, jump time JT. and data read times from disk, the target values for the smallest 
interleaved unit size ILVUM and the smallest interleave unit presentation time ILVUMT are calculated first. The smallest 
interleave unit presentation time ILVUMT can be obtained from equation 3, and the smallest interleaved unit size ILVUM 
30 can be obtained from equation 4 below. 

ILVUMT ;> JT + ILV UM/BIT [3] 
ILVUMT x B = ILVUM [4] 

35 

From equation 3 the smallest interleave unit presentation time ILVUMT is determined to be 2 sec, and the smallest 
GOP block data GM is 1 7.6 Mbit. Thus, if the smallest interleave unit, which is the smallest unit of the data layout, is a 
2-sec. equivalent data quantity and each GOP comprises 1 5 frames (NTSC format), then the smallest interleave unit is 
equivalent to the data stored in four GOP. 

40 As also stated above, one interleaving condition is that the interleave distance be less than or equal to the jumpable 
distance. This means that except for the VOB with the shortest presentation time, the total presentation time of plural 
VOB to which interleaving is applied must be shorter than the time that can be reproduced within the interleave dis- 
tance. If as in the above example the jumpable distance JM is 250 Mbit and the maximum VOB bit rate is 8.8 Mbps, the 
time JMT that can be reproduced with the data quantity read within the jumpable distant JM is 28.4 sec. It is then pos- 

45 sible to calculate an interleavable conditions equation using these values. If each VOB in the interleaved block region 
is divided into the same number of interleaved block and this number of interleaved divisions is v, equation 5 can be 
obtained from the conditions of the smallest interleave unit length. 

(presentation time of shortest VOB)/lLV UMT ^ v [5] 

50 

Equation 6 can then also be obtained from the jumpable presentation time conditions. 

v < (presentation time of all VOB except shortest VOB)/JMT [6] 

55 Plural VOB can in principle be interleaved if the above conditions are fulfilled. In practical terms, VOBU compensa- 
tion to the value calculated according to the above equations is necessary because the interleave units are limited to 
the VOBU boundaries. To compensate equations 2, 3, and 4, it is necessary to add the maximum VOBU time (1 . 0 sec.) 
to the smallest interleave unit presentation time ILVUMT, and reduce the presentation time in the interleave distance 
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the conditions for interleaving the VOB scenes before encoding, H is necessary to increase the number rirtMM unit 
ShtamTM means increasing the length of the scene forming the shortest VOB by mov,ng the or 
tZ also neSssary to add the same scene added to the shortest scene to the other scenes. In general, the inter- 
ne diia*erSi«nt»; greater than the shortest interleave unit, and the rate of increase in the value on the teft 
sTof^uTtfon 4 Ts gSte'tnan the increase in the value on the right side of equation 6. As a result, .ncreas.ng the 
Qizp of the shifted (moved) scene can result in satisfying the above conditions. h^hkoH 

^^SSthr^ca, pic** may lump im-neofcte,, arte, enterin, an heaved bock .torn , > 

thS No^rwhi^he above operation has assumed that a.. VOB are divided into the same number of interleave units 
it is SJSttt) group the VOB into a group of VOB containing u drvisions (interleave un,ts) and a group of VOB 

u arising H2 XT or a„ VOB from which a greater number of interleave units cannot be obtained L and us.ng the 
numoer onnterle^ve units obtained from equation 4 up to (u + 1 ) for larger VOB. Th,s .s .llustrated .n F.g^t 

^e^aS *uXe of the interleaved unit ILVU in another embodiment of the invent,on .s shown ^ 42. In ttvs 
e m .^r^r n avtaa«on pack NV described with reference to Fig. 20 is placed at the beginning of each v«Jeo object unrt 
VObU anS th™ ^d?rZ one na V j J tion pack NV to immediately before the next navigation pack NV is defined as one 
1« vOBU T^etideo object unit VOBU boundaries determine the boundaries of the .nterleaved unrt ILVU. 

ea^o^ 

^ Tach^f r^rS"^ a navigaion pack NV (management .nfor^ <« pad* 
each nal^n pa* Containing an ILVU end pack address ILVU.EA indicating the address of the lad ips* Mn the 

from the navigation pack NV containing the address ^ r ™*o n . 

When a video object unit VOBU is in an interleaved block region, the next-ILVU start address im i_ilvu^va m 
to the^rt adi^ ofte next ILVU, simi.arly expressed as the number of sectors from the nav,gafcon pack NV con- 

^S^JSX reading the first navigation pack NV data in the interred unit ILVU to obtain Jje ^ 
t^li^n^M^edo^ ILVU to be reproduced, and to what disk position the current .nterleaved unrt ILVU* writ- 
S 11Z n^TiossSe to on.y read ^interleaved units .LVU. and to smoothly jump to the next .nterleaved unrt 
ILVU. 

M^ilfi-<;r.ene control 

The multi-scene period is described together with the concept of multi-scene control according to the present 

even bfco^er gZhics data. The multi-angle scene periods may therefore also be caUed mult,-scene penods 
Parental control 

The concept ol recording plural Sties composing alternative scenes to, such amotions as parental lock control and 
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scenes" containing sex, violence, or other scenes deemed unsuitable for children are contained in a title implementing 
parental lock control, the title stream is recorded with a combination of common system streams SSa, SSb, and Sse, 
an adult-oriented system stream SSc containing the adult scenes, and a child-oriented system stream SSd containing 
only the scenes suitable for children. Title streams such as this are recorded as a multi-scene system stream containing 
the adult-oriented system stream Ssc and the child-oriented system stream Ssd arrayed to the multi-scene period 
between common system streams Ssb and Sse. 

The relationship between each of the component titles and the system stream recorded to the program chain PGC 
of a title stream thus comprised is described below. 

The adult-oriented title program chain PGC1 comprises in sequence the common system streams Ssa and Ssb. 
the adult-oriented system stream Ssc, and the common system stream Sse. The child-oriented title program chain 
PGC2 comprises in sequence the common system streams Ssa and Ssb, the child-oriented system stream Ssd, and 
the common system stream Sse. 

By thus arraying the adult-oriented system stream Ssc and child-oriented system stream Ssd to a multi-scene 
period, the decoding method previously described can reproduce the title containing adult-oriented content by repro- 
ducing the common system streams Ssa and Ssb. then selecting and reproducing the adult-oriented system stream 
Ssc. and then reproducing the common system stream Sse as instructed by the adult-oriented title program chain 
PGC1. By alternatively following the child-oriented title program chain PGC2 and selecting the child-oriented system 
stream Ssd in the multi-scene period, a child-oriented title from which the adult-oriented scenes have been expurgated 
can be reproduced. 

This method of providing in the title stream a multi-scene period containing plural alternative scenes, selecting 
which of the scenes in the multi-scene period are to be reproduced before playback begins, and generating plural titles 
containing essentially the same title content but different scenes in part, is called parental lock control. 

Note that parental lock control is so named because of the perceived need to protect children from undesirable con- 
tent. From the perspective of system stream processing, however, parental lock control is a technology for statically 
generating different title streams by means of the user pre-selecting specific scenes from a multi-scene period. Note, 
further, that this contrasts with multi-angle scene control, which is a technology for dynamically changing the content of 
a single title by means of the user selecting scenes from the multi-scene period freely and in real-time during title play- 
back. 

This parental lock control technology can also be used to enable title stream editing such as when making the 
director's cut. The director's cut refers to the process of editing certain scenes from a movie to, for example, shorten the 
total presentation time. This may be necessary, for example, to edit a feature-length movie for viewing on an airplane 
where the presentation time is too long for viewing within the flight time or certain content may not be acceptable. The 
movie director thus determines which scenes may be cut to shorten the movie. The title can then be recorded with both 
a full-length, unedited system stream and an edited system stream in which the edited scenes are recorded to multi- 
scene periods. At the transition from one system stream to another system stream in such applications, parental lock 
control must be able to maintain smooth playback image output. More specifically, seamless data reproduction whereby 
a data underflow state does not occur in the audio, video, or other buffers, and seamless information reproduction 
whereby no unnatural interruptions are audibly or visibly perceived in the audio and video playback, are necessary. 



Multi-angle control 

The concept of multi-angle scene control in the present invention is described next with reference to Fig. 44. In gen- 
eral multimedia titles are obtained by recording both the audio and video information (collectively "recording" below) of 
the subject over time T The angled scene blocks #SC1, #SM1, #SM2, #SM3, and #SC3 represent the multimedia 
scenes obtained at recording unit times T1 , T2. and T3 by recording the subject at respective camera angles. Scenes 
#SM1 , #SM2, and #SM3 are recorded at mutually different (first, second, and third) camera angles during recording unit 
time T2, and are referenced below as the first, second, and third angled scenes. 

Note that the multi-scene periods referenced herein are basically assumed to comprise scenes recorded from dif- 
ferent angles. The scenes may, however, be recorded from the same angle but at different times, or they may be com- 
puter graphics data. The multi-angle scene periods are thus the multi-scene periods from which plural scenes can be 
selected for presentation in the same time period, whether or not the scenes are actually recorded at different camera 
angles. 

Scenes #SC1 and #SC3 are scenes recorded at the same common camera angle during recording unit times T1 
and T3, i.e., before and after the murti-angle scenes. These scenes are therefore called "common angle scenes." Note 
that one of the multiple camera angles used in the multi-angle scenes is usually the same as the common camera 
angle. 

To understand the relationship between these various angled scenes, multi-angle scene control is described below 
using a live broadcast of a baseball game for example only. 
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The common angle scenes #SC1 and #SC3 are receded at the common camera angle, which is here defined as 
the view from center field on the axis through the pitcher, batter, and catcher. 

The first angled scene #SM1 is record* at *. first the second mutt- 

on the axis through the catcher, prtcher. and batter. The secorx ang '^ene #*m^ s 
camera angle, i.e.. the view f^^ 

angled scene #SM2 is thus the same as the common famera angle dj unjt time T2 . The third 

S^sc^^^^ 
"^esen^^ 

period is called the "multi-angle scene period. By ^^f^™"™^*^ position to enjoy a different 

seleSng different angled scenesjhere is no actual ^^Hl!2fSi«on is described next with reference to 
Multi-angle scene control of the system st eam ^^ 0 " to common angle scene 

Fig. 23 from the perspective of connect ng , ^^J^^^fe^ unit times T1 and T3 are 

#SC is referenced as common angle data ^^J^TSlSwS^ to the muttiple angle scenes #SM1. 

the multiple angle .data unite L MA1 ^ s ^However. intermittent breaks in the playback information can result 
25 In the case of an MPEG system stream, nowev* . Qj n ^ ^tenl of the data at the connec- 

between the reproduced common and multiple angle da ta un rtsd, p ero in^up date ^ ^ {jrst 

reproduction results. cranes is selectively reproduced from the multi-angle 

Dieted before rerxoducrjor. ol the preceding common angle dale BAIt « comre«ea >e > 

fo change ,o e drKeren, angi e da* unKM« *^«^Se 'JSS fiSKS «nd ft. data oreah pomts 

r^rrerri^ 
- c=^=^r^s^ 

We within this GOP processing unit selected before reproduction reaches the multi-angle scene 

In other words, if the desired angle data. e. g., MA3, '^elected D«ore rep completed, the angle data 

period, and at the latest by the time reproducing .extremely difficult while 
selected from within ™^ the same multi-angle scene period^t 

Sete^ 

pie. a view from a different camera angle. . described in further detail 

The method of sw*chN , between s-e -gjs ^ 

MA1. S.A2. and MA3 shown in F,gJ3. WHhirv the ^^SS 1 !, date un?(VOB) com- 
MA1 . MA2, and MA3 are the title editing unite ue JJ^J^lSpSl angle switching unit. Note that each 
prises plural (three in this example) interleave ^*^^f ™2 anS data MAI thus contains angle switching 
interleaved unit ILVU thus contains a particular number of GOP. The tirst angie aaw 

units A51 . A52. and A53. . data MA1 have presentation times of 1 . 2. and 3 sec- 
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MA2 similarly comprises angle switching units B51 , B52, and B53 with presentation times of 2, 3. and 1 second, respec- 
tively, and the third angle data MA3 comprises angle switching units C51, C52, and C53 with presentation times of 3, 
1 , and 2 seconds, respectively. It should be noted that while the angle data MA1 , MA2. and MA3 each has a 6 sec. pres- 
entation time, the presentation time can obviously be set to various other particular values. 

5 The following example describes switching to and beginning the presentation of another (next) angle view while 

reproducing any particular interleave unit. For example, if the angle is changed from first angle data MA1 to second 
angle data MA2 while reproducing interleave unit A51 , reproduction of interleave unit A51 stops and reproduction of the 
second interleave unit B52 in the selected second angle data MA2 starts. This change intermits both the audio and 
video presentation, resulting in non-seamless information presentation. 

w Likewise, if the user chooses to switch to the scene corresponding to the third angle data MA3 during reproduction 
of the second interleave unit B52 in the second angle data MA2, reproduction of interleave unit B52 stops and repro- 
duction of interleave unit C53 starts. This operation again intermits both the audio and video presentation, resulting in 
non-seamless information presentation. 

Note that in the above cases switching between multiple angle data during data reproduction stops reproduction of 

75 the angle being presented, and seamless information presentation, i.e., presentation without intermitting the audio and 
video, does not occur. 

The method of switching the angle after completing interleave unit presentation is described below. 
If the user selects second angle data MA2 during reproduction of interleave unit A51 in first angle data MA1, for 
example, the second interleave unit B52 in the second angle data MA2 is selected from the point reproduction of inter- 
20 leave unit A51 with a presentation time of 1 sec. is completed. Because the start of interleave unit B52 is 2 sec. from 
the beginning of the multi-angle scene period, there is a gap between the end of interleave unit A51 reproduction at 1 
sec. into the multi-angle scene period and the beginning of interleave unit B52 at 2 sec. into the multi-angle scene 
period, and there is therefore no time-base contiguity in reproduction. More specifically, because there is no continuity 
in the audio, the audio cannot be seamlessly reproduced. 
25 A similar scenario results if the user then switches to the angle scene corresponding to the third angle data MA3 
during reproduction of the second interleave unit B52 in the selected second angle data MA2, i.e., interleave unit C53 
is switched to after reproduction of interleave unit B52 is completed. In this case the completion of interleave unit B52 
reproduction is at 5 sec. from the beginning of the multi-angle scene period, the beginning of interleave unit C53 is at 4 
sec. from the beginning of the multi-angle scene period, and there is again no time-base contiguity. The connection 
30 between the audio and video components of the interleave units B52 and C53 is also not good. 

It is therefore necessary for the interleave units of each angle to have the same reproduction time and contain the 
same number of video frames in order to achieve seamless information switching between multiple angles. 

The internal structure of the interleave units is shown in further detail in Fig. 77. As shown in Fig. 77. each of the 
interleave units ILVUbl , ILVUb2. ILVUd , and ILVUc2 contains interleaved audio and video packets, indicated as A and 
35 V, respectively. 

In general, the data size and presentation time of each audio packet A is the same. In this example, interleave units 
ILVUbl , ILVUb2, ILVUd , and ILVUc2 respectively contain 3,2.2. and 3 audio packets. Therefore, angle data MAB and 
MAC each contain 5 audio packets and 13 video packets in multi-angle scene period T2. 

Angle switching control within a multi-angle scene period comprising a multi-angle system stream (VOB) with this 
40 packet structure is described below. 

If the angle is switched from interleaved unit ILVUbl to interleaved unit ILVUc2, for example, the total number of 
audio packets in these two interleave units ILVUbl and ILVUc2 is 6, which is one more than the specified total number 
of 5 audio packets in this multi-angle scene period T2. This would therefore result in an overlap of one audio packet in 
the reproduced audio if these two ILVU were connected and reproduced. Likewise, if the angle is changed between the 
45 two interleaved units ILVUd and ILVUb2 containing only two audio packets each, the total of 4 audio packets is one 
less than the specified total number of 5 audio packets in this multi-angle scene period T2. Connecting and reproducing 
these two interleaved units would therefore result in an audio packet shortage, and a break in the audio equal to the 
duration of one audio packet. It is therefore clear that when the number of audio packets in the connected interleaved 
units is not equal to the specified number of audio packets in that multi-angle scene period, the audio will not be satis- 
50 factorily connected, and non-seamless information presentation containing noise and/or intermittent audio will result. 

Fig. 45 shows multi-angle scene control when the multi-angle data MAB and MAC shown in Fig. 77 contain different 
audio data. Common angle data BA1 and BA3 contain the common audio data of the common scenes before and after 
the multi-angle scene period. The first angle data MAB comprises first angle interleave unit audio data ILVUbl and 
ILVUb2, the smallest angle switching unit within the multi-angle scene period. The second angle data MAC likewise 
55 comprises second angle interleave unit audio data ILVUd and ILVUc2. 

Fig. 1 4 shows the audio waves resulting from the audio data recorded to the multi-angle data MAB and MAC within 
multi-angle scene period T2. The contiguous audio output of angle data MAB is formed from the two interleaved unit 
audio data ILVUbl and ILVUb2. The contiguous audio output of angle data MAC is likewise formed from the two irrter- 
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leaved unit audio data ILVUd and ILVUc2. 

What happens when reproduction of angle data MAC is switched to during reproduction of the first interleave unit 
audio data ILVUbl in angle data MAB is described below. In this case reproduction of interleaved unit ILVUc2 occurs 
after reproduction of interleaved unit ILVUbl is completed. As a result, the audio wave resulting from these two inter- 

5 leaves units is a mixed wave as shown by audio wave MAB-C in Fig. 12. Note that the resulting audio output wave is 
non-contiguous at the angle switching point, and the audio connection is therefore not completely satisfactory. 

If the audio data is encoded using Dolby AC-3 coding, the problems are even greater because AC-3 coding use 
time-base correlations. Specifically, because the audio is coded using time-base signal correlations, reproduction 
becomes impossible at the angle switching point when the user attempts to stop the audio data from one angle and 

w switch to the audio data of another angle during reproduction in a multi-angle scene period. 

Thus, when each angle in a multi-angle scene period contains discrete audio data, the audio output may be inter- 
mitted between the connected data units when the angle is changed. Depending on the content of the data in such 
cases, noise and/or audio interruptions may occur during reproduction, resulting even in irritation or discomfort to the 
user. Such discomfort is a product of the non-contiguity in the reproduced information content, and can therefore be 

is avoided by assuring information contiguity or preventing information intermittence. Seamless information presentation 
can thus be achieved. 

Multi-angle scene control according to the present invention is described next with reference to Fig. 46. Three angle 
data MA1 , MA2, and MA3 are written to multi-angle scene period T2 in this example. 

The first angle data MA1 thus contains angle switching units A11, A12, and A13 as the smallest possible angle 
20 switching units. These angle switching units A51 , A52, and A53 have presentation times of 2. 1 , and 3 seconds, respec- 
tively. 

The second angle data MA2 similarly comprises angle switching units B1 1 , B12, and B1 3, and the third angle data 
MA3 comprises angle switching units C11, C12, and C13. Each of these angle switching units also respectively have 
presentation times of 2, 1 , and 3 seconds. 
25 Because the synchronized interleave units thus have the same presentation times, contiguous audio and video 
reproduction can be achieved without interruptions or overlaps at the angle switching position even when the user 
switches from one angle to another angle, and seamless information presentation can therefore be achieved as previ- 
ously described. 

To achieve the data structure shown in Fig. 46, i.e., to set the actual presentation times of the smallest angle switch- 

30 ing units of the audio and video material in the mufti-angle scene period to the same times, the number of reproduction 
frames in each interleave unit must be the same. MPEG data compression normally operates on the GOP unit level, 
and two parameters defining the GOP structure are the M and N values. M is the I- or P-picture cycle, and N is the 
number of frames contained in that GOP Furthermore, changing the M or N parameter setting frequently during the 
encoding process simply complicates MPEG video encoding control, and is not, therefore, normally done. 

35 A method of achieving the data structure shown in Fig. 46, i.e., a method of setting the actual presentation times of 
the smallest angle switching units of the audio and video material in the multi-angle scene period to the same times, is 
described below with reference to Fig. 78. 

As the example shown in Fig. 78 writes two angle data units MAB and MAC to the multi-angle scene period, and 
each angle data unit comprises two interleaved units ILVUbl and ILVUb2, and ILVUd and ILVUc2, respectively. This is 

to for simplicity of discussion only. Each interleaved unit is a GOP structure. The GOP structures in the multi-angle scene 
period set the M and N parameters of the synchronized interleaved units ILVUbl and ILVUd to the same values, and 
similarly set the M and N parameters of the synchronized interleaved units ILVUb2 and ILVUc2 to the same values. By 
thus using the same GOP structure parameters in both angle data MAB and MAC, the presentation times of the AV data 
in different angles can be set to the same on the smallest angle switching unit level. Thus, when the angle is changed 

ts from first angle data MAB ILVUb2 to angle data MAC ILVUc2, contiguous video presentation without video intermittence 
or overlap at the angle switching position can be achieved because the angle switching timing between these two ILVU 
is the same. 

A method for actually setting the audio data presentation times on the smallest angle switching unit level to the 
same value in different angles is described next with reference to Fig. 79. As in Fig. 77, the interleave units ILVUbl, 
so ILVUb2, ILVUd , and ILVUc2 contain interleaved audio and video packets, indicated as A and V, respectively. 

The audio packet A data size and presentation times are normally constant. As also shown in the figure, ILVUbl 
and ILVUd in the multi-angle scene period have the same number of audio packets (2). and ILVUb2 and ILVUc2 have 
the same number of audio packets (3). By thus writing the same number of audio packets to the synchronized inter- 
leaved units ILVU of different angle data MAB and MAC, the audio data presentation times can be set the same on the 
is smallest angle switching unit level in different angles. Thus, when the view is changed between angle data MAB and 
MAC, contiguous audio presentation without audio intermittence or overlap at the angle switching position can be 
achieved because the angle switching timing does not change. 

However, if each of the smallest angle switching units in the multi-angle scene period contain audio data describing 
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discrete audio waves, it may not be possible (as described above with reference to Fig. 12) to achieve contiguous audio 
data reproduction at the angle switching point by simply coding each smallest angle switching unit (ILVU) with the same 
audio data presentation time. It is possible to avoid intermitting the audio presentation, however, by writing the : same 
(common) audio data to each angle within a multi-angle scene period on the smallest angle switching unit (ILVU) level. 
As previously described, seamless information presentation can be achieved by arraying the data to assure contiguous 
information content before and after the connection points in the reproduced data, or by formatting the data in closed 
data units that are completely reproduced at the switching points. 

A further multi-angle scene period data structure in which common audio data is written to each different angle is 
shown in Fig. 80. Unlike the structure shown in Fig. 45, this data structure is achieved by writing audio data that is com- 
Dlete within each switching unit (interleaved unit ILVU) to the angle data MAB and MAC. 

Because the audio data is thus complete within each interleaved unit ILVU, audio data producing a discomforting 
audio wave as a result of connecting different audio waves at the angle switching point will not resuft even when the 
angle is changed by switching from first angle interleaved unit ILVUbl to second angle interleaved unit ILVUc2 within 
an encoded audio data multi-angle scene period thus structured. 

Note that if the audio data is formed with the same audio wave on the interleaved unit ILVU level, seamless infor- 
mation presentation can obviously be achieved just as when the audio wave data is complete within each interleaved 

Unit When the audio data is coded using Dolby AC-3 audio coding, the time-base correlations can be retained even 
when the angle is changed, and contiguous audio reproduction can be achieved at the angle switching point wrthout 
noise or intermittence. because common audio data is written to the smallest angle switching units (ILVU) of the angle 
data or is complete within each interleaved unit ILVU. 

It should be noted that the present invention shall not be limited to only two or three types of angle data MA within 
the multi-angle scene period. The multi-angle scene period T2 shall also not be limited to VOB units, and may extend 
for the duration of the title stream. 

It is therefore possible to achieve seamless information presentation as previously defined. 

The operation of multi-angle control based on the digital video disk data structure is thus as described above. 

A method of recording to the recording medium mufti-angle control data specifically enabling selection of a different 
angle data unit while reproducing one angle data unft in the same multi-angle scene period .s. however, described 

The common angle data BA1 shown in Fig. 23 is arrayed to a contiguous block region, the interleave unit data of 
the mufti-angle data MA1 . MA2, and MA3 in the multi-angle scene period is arrayed to an interleaved block region and 
the common angle data BA3 following thereafter is arrayed to the next contiguous block region. Referenced to the data 
structure shown in Fig. 1 6. the common angle data BA1 constitutes one cell, the multi-angle data MA1 , MA2. and MA3 
each constitute one cell, the cells corresponding to the multi-angle data MA1 . MA2. and MA3 constitute one cell block 
(the cell block mode CBM of the MAI cell = "cell block start", the CBM of the MA2 cell = "between first and last cells . 
and the CBM of the MA3 cell = "cell block end"), and these cell blocks are angle blocks (cell block type CBT = angle). 
The common angle data BA3 is the cell that connects to that angle block. The cell-cell connections are also set for 
seamless playback (seamless playback flag SPF = "seamless playback"). 

The structure of the stream containing a multi-angle scene period and the basics of the on-d.sk layout in the pre- 
ferred embodiment of the invention are shown in Fig. 47. The multi-angle scene period is a period in which the: stream 
can be freelv changed by the user. In a stream with the structure shown in Fig. 47. it is possible to switch to VOB-C or 
VOB-D during reproduction of VOB-B. It is therefore likewise possible to switch to VOB-B or VOB-D during reproduction 
of VOB-C. and to switch to VOB-C or VOB-B during reproduction of VOB-D. 

As previously described, the unit for changing the angle, i. e.. the angle selection unit, is the smallest interle ave ur» 
obtained from the conditions defined by equations 3 and 4. and is defined as the angle interleave unit A- ILVU. A-ILVU 
management information is also added to the angle interleave unit A-ILVU. The navigation pack NV previously 
described corresponds to this A-ILVU management information. . MW11 , . 

An example in which the last pack address in the current A-ILVU and the address of the next A-ILVU are recorded 
for plural angles is shown as another embodiment of the invention in Fig. 48. While this figure is very similar to Fig 42 
the angle interleave unit A-ILVU comprises two video object units VOBU in this example, and the navigator i pack NV of 
each VOBU contains the ILVU end pack address ILVUJEA indicating the address of the last pack in the ILVU to which 
the VOBU belongs, and the next-ILVU start address for each angle data block #1 - #9 (SML_AGL_C1 DSTA - 
SML AGL C9 DSTA) These addresses are expressed as the number of sectors from the navigation pack NV contain- 
ina the address" information. In fields in which there is no angle data, information indicating that there is no angle data. 
e l 0 is recorded By thus recording the last pack address and the next-ILVU start address for each angle data block 
#1 1 - #9. it is possible to know the address of each next selectable angle and to switch to a different angle scene without 
reading unnecessary angle information. 

For interleaving data in the multi-angle scene period, all angles are interleaved to the same time reference (inter- 
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leave boundary) using the angle interleave unit A-ILVU for the shortest read time. This is to enable the angle to be 
changed as quickly as possible within the performance limits of the DVD player. Interleave unit data is temporarily 
stored to the track buffer, and the data for the angle selected next is then input to the track buffer. However, the next 
angle cannot be reproduced until the data for the previous angle (the angle being reproduced when the angle was 
5 changed) in the track buffer has been consumed. It is therefore necessary to minimize the size of the angle interleave 
unit A-ILVU in order to maximize the speed at which the next angle scene can be selected and presented. A common 
interleave unit size and boundary position must therefore be used between the VOB constituting each angle. 

This means that the reproduction time of the encoded video stream forming the VOB must be the same for each 
VOB, and the time at which each interleave unit can be reproduced must be the same, i.e., the interleave unit bounda- 
ry ries must be common, for each interleave unit at the same reproduction time in each angle. More specifically, the VOBs 
constituting each angle must be divided into the same number of interleave units, and the reproduction time of said 
interleave units must be the same in each angle, i.e., the VOBs constituting each angle must be divided into the same 
number N of interleave units, and the k-th interleave unit (where 1 < k < N) in each angle must have the same reproduc- 
tion time. 

75 Moreover, to seamlessly reproduce the interleave units between each angle, the encoded stream must be com- 
plete within each interleave units. This means that under the MPEG standard, a closed GOP structure using a compres- 
sion method that does not reference any frames outside the interleave unit must be used. If this method is not used, it 
is not possible to seamless connect and reproduce interleave units from different angles. By using this VOB structure 
and interleave unit alignment, however, contiguous reproduction is possible time-wise even if the angle is changed. 

20 The number of interleave units in the multi-angle scene period is determined by the number of interleave units in 
the other angles that can be arrayed within the jumpable distance after an interleave unit is read. Regarding the 
arrangement of the interleaved units in each angle, the interleave units reproduced first in each angle are arrayed in the 
angle sequence, and are subsequently followed by the interleave units reproduced next in each angle being arrayed in 
the angle sequence. Thus, if the number of angles is M (where M is a natural number, and 1 < M < 9), the m-th angle is 

25 angle #m (where m is a natural number, and 1 < m < M), the number of interleave units is N (where N is a natural 
number greater than or equal to 1), and the n-th interleave unit in a VOB is interleave unit #n (where n is a natural 
number, and 1 < n < N), the interleave unit sequence is angle #1 interleave unit #1 , angle #2 interleave unit #1 , angle 
#3 interleave unit #1 , . . . angle #M interleave unit #1 , angle #1 interleave unit #2, angle #2 interleave unit #2, . . . 

If the interleave unit length of each angle is the shortest read time for seamlessly selected angles where the angle 

30 change is seamless, the maximum distance that must be jumped when moving between angles is the distance from the 
first angle interleave unit in the sequence to the last interleave unit in the interleave unit sequence of the next angle 
reproduced within the sequence of angle interleave units reproduced at the same time. Thus, if the number of angles is 
An, the jump distance must satisfy the following equation 7. 

35 max. angle ILVU length x (An - 1) x 2 < jumpable distance [7] 

With non-seamless multi-angle switching, each angle must be seamlessly reproduced, but seamless reproduction 
is not required when moving between angles. Therefore, if the interleave unit length of each angle is the shortest read 
time, the maximum distance that must be jumped when moving between angles is the distance between the interleave 
40 units of each angle. Thus, if the number of angles is An, the jump distance must satisfy the following equation 8. 

max. angle ILVU length x (An - 1) < jumpable distance [8] 

The method of managing addresses on the switching unit level between multiple angle data VOB in multi-angle 
45 scene periods is described below with reference to Figs. 49 and 50. In Fig. 49 the angle interleave unit A-ILVU is the 
data switching unit, and the address of another angle interleave unit A-ILVU is written to the navigation pack NV of each 
angle interleave unit A-ILVU. Fig. 49 shows the address description achieving seamless reproduction, i.e., uninter- 
rupted reproduction of the audio and video data. This addressing method specifically provides for control whereby only 
the data for the interleave units of the angle to be reproduced is read into the track buffer when the angle is changed. 
so Fig. 50 shows an example in which the video object unit VOBU is the data switching unit, and the address of 
another video object unit VOBU is written to the navigation pack NV of each video object unit VOBU. This addressing 
method provides for non-seamless reproduction control whereby reproduction can be changed as quickly as possible 
to another angle near the time when the angle is changed. 

In Fig. 49, each angle interleave unit A-ILVU in the three multi-angle data VOB-B, VOB-C, and VOB-D records as 
55 the address of the next angle interleave unit A-ILVU to be reproduced the address of a chronologically later A-ILVU. 
Note that VOB-B is designated angle #1 , VOB-C is angle #2, and VOB-D is angle #3. The murti-angle data VOB-B con- 
sists of angle interleave units A-ILVUb1, A-ILVUb2, and A-ILVUb3. multi-angle data VOB-C similarly consists of angle 
interleave units A-ILVUc1 , A-ILVUc2. and A-ILVUc3, and multi-angle data VOB-D of angle interleave units A-ILVUdl , A- 
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interleaved unit ILVU, the listener will not even be able to discern that the audio track has changed if the audio is seam- 
lessly reproduced across angle changes. 

A data structure enabling non-seamless reproduction of angle changes, i.e., seamless data reproduction permitting 
perceptible breaks in the content of the reproduced information, is described below with reference to Fig. 50. 
s It is assumed in Fig. 50 that multi-angle data VOB-B comprises three video object units VOBUbl , VOBUb2, and 

VOBUb3. Multi-angle data VOB-C similarly comprises three video object units VOBUd, VOBUc2, and VOBUc3. and 
VOB-D comprises three video object units VOBUdl, VOBUd2, and VOBUd3. As in the case shown in Fig. 49, the nav- 
igation pack NV of each video object unit VOBU contains the end pack address VOBU_EA of each VOBU. Note that 
this end pack address VOBU_EA is the address of a navigation pack NV in a VOBU comprising a navigation pack NV 
10 and one or more other packs. In this example, however, the NSML_AGL_C#_DSTA address written to the navigation 
pack NV of each VOBU does not describe a chronologically later VOBU address, but the address of a VOBU in a differ- 
ent angle with a reproduction time preceding the time the angle is changed. 

In other words, addresses NSML_AGL_C#1 _DSTA to NSM L_AGL_C#9_D STA to VOBU in other angles synchro- 
nized to the current VOBU are recorded. As above, #1 - #9 represent different angle numbers. For fields in which an 
15 angle of the corresponding number does not exist, a value, e.g., 0, indicating that the angle does not exist is recorded. 
Thus, the navigation pack NV of video object unit VOBUbl of multi-angle data VOB-B contains N SM L_AGL_C#2_D STA 
and NSM L_AGL_C#3__D STA, the relative addresses of the synchronized VOBUd and VOBUdl in VOB-C and VOB-D 
as shown by lines Pb1c' and Pb1d\ 

The navigation pack NV of video object unit VOBUb2 similarly contains as shown by lines Pb2c' and Pb2d' the rel- 
20 ative addresses NSML_AGL_C#2JDSTA and NSML_AGL_C#3_DSTA of the synchronized VOBUc2 and VOBUd2. The 
navigation pack NV of video object unit VOBUb3 similarly contains as shown by lines Pb3c' and Pb3d' the relative 
addresses NSM L_AGL_C#2_D STA and NSML_AGL_C#3_DSTA of the synchronized VOBUc3 and VOBUd3. 

The navigation packs NV of video object units VOBUd , VOBUc2, and VOBUc3 of VOB-C similarly contain the rel- 
ative addresses NSML_AGL_C#1_DSTA and N SM L_AGL_C#3_D STA of the respectively synchronized VOBU as 
25 shown by lines Pc1b\ Pc1d\ Pc2b\ Pc2d\ Pc3b\ and Pc3d'. 

The navigation packs NV of video object units VOBUdl , VOBUd2, and VOBUd3 of VOB-D likewise contain the rel- 
ative addresses NSML_AGL_C#1_DSTA and NSML_AGL_C#2_DSTA of the respectively synchronized VOBU as 
shown by lines Pd1b\ Pd1c\ Pd2b\ Pd2c\ Pd3b\ and Pd3c'. 

The angle selection address information N SM L_AG L_C#4_D STA to N SML_AGL_C#9_D STA for non-existent 
30 angles #4 - #9 in this example record a value indicating that corresponding angles do not exist, e.g., a 0 value. 

During reproduction of angle data with this data structure, the DVD decoder interrupts reproduction of the angle 
VOBU data currently being reproduced when the angle is changed, then reads and begins reproducing the VOBU data 
for the selected angle. 

Note that while there appears to be a time offset between the interleaved units of VOB-C relative to VOB-B and 
35 VOB-D in Fig. 50, this is merely to facilitate easier understanding of the address descriptions written to the respective 
navigation packs NV of each VOB using the line arrows as shown. There is, in fact, no time shifting between the respec- 
tive interleaved video objects VOB, which are aligned as shown in Fig. 49. 

The data structure shown in Fig. 50 thus contains as pointers to the next VOBU to be reproduced addresses to 
another VOBU that should normally be reproduced either simultaneously with or chronologically before the VOBU con- 
40 taining the address information. Therefore, when the angle is changed, reproduction is continued from a scene at a past 
point in time. This method of encoding the address information is thus preferable when seamless angle switching not 
required, i.e., for non-seamless information reproduction in which contiguity is not required in the reproduced informa- 
tion. 

45 Flow chart: encoder 

The encoding information table generated by the encoding system controller 200 from information extracted from 
the scenario data St7 is described below referring to Fig. 27. 

The encoding information table contains VOB set data streams containing plural VOB corresponding to the scene 
so periods beginning and ending at the scene branching and connecting points, and VOB data streams corresponding to 
each scene. These VOB set data streams shown in Fig. 27 are the encoding information tables generated at step #1 00 
in Fig. 51 by the encoding system controller 200 for creating the DVD multimedia stream based on the user-defined title 
content 

The user-defined scenario contains branching points from common scenes to plural scenes, or connection points 
55 to other common scenes. The VOB corresponding to the scene period delimited by these branching and connecting 
points is a VOB set, and the data generated to encode a VOB set is the VOB set data stream. The title number specified 
by the VOB set data stream is the title number TITLEJMO of the VOB set data stream. 

The VOB Set data structure in Fig. 27 shows the data content for encoding one VOB set in the VOB set data 
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stream, and comprises: the VOB set number VOBS_NO, the VOB number VOBJMO in the VOB set. the preceding VOB 
seamless connection flag VOB_Fsb, the following VOB seamless connection flag VOB_Fsf, the multi-scene flag 
VOB_Fp, the interleave flag VOB_Fi. the multi-angle flag VOB_Fm, the multi-angle seamless switching flag VOB_FsV. 
the maximum bit rate of the interleaved VOB ILV_BR. the number-of interleaved VOB divisions ILVJDIV, and the mini- 
5 mum interleaved unit presentation time ILVU_MT. 

The VOB set number VOBS_NO is a sequential number identifying the VOB set and the position of the VOB set in 
the reproduction sequence of the title scenario. 

The VOB number VOB_NO is a sequential number identifying the VOB and the position of the VOB in the repro- 
duction sequence of the title scenario. 
w The preceding VOB seamless connection flag VOB_Fsb indicates whether a seamless connection with the preced- 
ing VOB is required for scenario reproduction. 

The following VOB seamless connection flag VOB_Fsf indicates whether there is a seamless connection with the 
following VOB during scenario reproduction. 

The multi-scene flag VOB_Fp identifies whether the VOB set comprises plural video objects VOB. 
15 The interleave flag VOB_Fi identifies whether the VOB in the VOB set are interleaved. 

The multi-angle flag VOB_Fm identifies whether the VOB set is a multi-angle set. 

The multi-angle seamless switching flag VOB_FsV identifies whether angle changes within the multi-angle scene 
period are seamless or not. 

The maximum bit rate of the interleaved VOB ILV_BR defines the maximum bit rate of the interleaved VOBs. 
20 The number of interleaved VOB divisions ILV_DIV identifies the number of interleave units in the interleaved VOB. 

The minimum interleave unit presentation time ILVUJvIT defines the time that can be reproduced when the bit rate 
of the smallest interleave unit at which a track buffer data underflow state does not occur is the maximum bit rate of the 
interleaved VOB ILV_BR during interleaved block reproduction. 

The encoding information table for each VOB generated by the encoding system controller 200 based on the sce- 
25 nario data St7 is described below referring to Fig. 28. The VOB encoding parameters described below and supplied to 
the video encoder 300, audio encoder 700, and system encoder 900 for stream encoding are produced based on this 
encoding information table. 

The VOB data streams shown in Fig. 28 are the encoding information tables generated at step #100 in Fig. 51 by 
the encoding system controller 200 for creating the DVD multimedia stream based on the user-defined title content. 
30 The encoding unit is the video object VOB. and the data generated to encode each video object VOB is the VOB 
data stream. For example, a VOB set comprising three angle scenes comprises three video objects VOB. The data 
structure shown in Fig. 28 shows the content of the data for encoding one VOB in the VOB data stream. 

The VOB data structure contains the video material start time VOB_VST. the video material end time VOB_VEND, 
the video signal type VOB_V_KlND, the video encoding bit rate V_BR, the audio material start time VOB_AST. the 
35 audio material end time VOB_AEND. the audio coding method VOB_A_KIND, and the audio encoding bit rate A_BR. 

The video material start time VOB_VST is the video encoding start time corresponding to the time of the video sig- 
nal. 

The video material end time VOB_VEND is the video encoding end time corresponding to the time of the video sig- 
nal. 

40 The video material type VOB_V_KIND identifies whether the encoded material is in the NTSC or PAL format, for 
example, or is photographic material (a movie, for example) converted to a television broadcast format (so-called tele- 
cine conversion). 

The video encoding bit rate V_BR is the bit rate at which the video signal is encoded. 

The audio material start time VOB_AST is the audio encoding start time corresponding to the time of the audio sig- 

45 nal. 

The audio material end time VOB_AEND is the audio encoding end time corresponding to the time of the audio sig- 
nal. 

The audio coding method VOB_A_KIND identifies the audio encoding method as AC-3, MPEG, or linear PCM, for 
example. 

50 The audio encoding bit rate A_BR is the bit rate at which the audio signal is encoded. 

The encoding parameters used by the video encoder 300, sub-picture encoder 500, and audio encoder 700, and 
system encoder 900 for VOB encoding are shown in Fig. 29. The encoding parameters include: the VOB number 
VOB_NO, video encode start time V_STTM, video encode end time V_ENDTM, the video encode mode V_ENCMD. 
the video encode bit rate V_RATE, the maximum video encode bit rate V_MRATE, the GOP structure fixing flag 

55 GOP_Fxflag, the video encode GOP structure GOPST, the initial video encode data VJNTST, the last video encode 
data V_ENDST, the audio encode start time A_STTlv1, the audio encode end time A_ENDTM, the audio encode bit rate 
AERATE, the audio encode method A_ENCMD, the audio start gap A_STGAP, the audio end gap A_ENDGAP, the pre- 
ceding VOB number B_VOB_NO, and the following VOB number F_VOB_NO. 
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scene is a multi-scene period, a seamless connection still refers to seamlessly connecting the target scene with any 
one of the scenes from the same multi-scene period. 

If step #300 returns NO, i.e., a non-seamless connection is valid, the procedure moves to step #400. 

At step #400, the encoding system controller 200 resets the preceding VOB seamless connection flag VOB_Fsb 
5 indicating whether there is a seamless connection between the target and preceding scenes. The procedure then 
moves to step #600. 

On the other hand, if step #300 returns YES, i.e.. there is a seamless connection to the preceding scene, the pro- 
cedure moves to step #500. 

At step #500 the encoding system controller 200 sets the preceding VOB seamless connection flag VOB_Fsb. The 
w procedure then moves to step #600. 

At step #600 the encoding system controller 200 determines whether there is a seamless connection between the 
target and following scenes based on scenario data St7. If step #600 returns NO, i.e., a non-seamless connection is 
valid, the procedure moves to step #700. 

At step #700, the encoding system controller 200 resets the following VOB seamless connection flag VOB_Fsf indi- 
15 eating whether there is a seamless connection with the following scene. The procedure then moves to step #900. 

However, if step #600 returns YES, i.e., there is a seamless connection to the following scene, the procedure 
moves to step #800. 

At step #800 the encoding system controller 200 sets the following VOB seamless connection flag VOB_Fsf. The 
procedure then moves to step #900. 
20 At step #900 the encoding system controller 200 determines whether there is more than connection target scene, 
i.e., whether a multi-scene period is selected, based on the scenario data St7. As previously described, there are two 
possible control methods in multi-scene periods: parental lock control whereby only one of plural possible reproduction 
paths that can be constructed from the scenes in the multi-scene period is reproduced, and multi-angle control whereby 
the reproduction path can be switched within the multi-scene period to present different viewing angles. 
25 If step #900 returns NO, i.e., there are not multiple scenes, the procedure moves to step #1000. 

At step #1000 the multi-scene flag VOBJ=p identifying whether the VOB set comprises plural video objects VOB (a 
multi-scene period is selected) is reset, and the procedure moves to step #1 800 for encode parameter production. This 
encode parameter production subroutine is described below. 

However, if step #900 returns YES, there is a multi-scene connection, the procedure moves to step #1 100. 
30 At step #1 100, the multi-scene flag VOB_Fp is set, and the procedure moves to step #1200 whereat it is judged 
whether a multi-angle connection is selected, or not 

At step #1200 it is determined whether a change is made between plural scenes in the multi-scene period, i.e., 
whether a multi-angle scene period is selected. If step #1200 returns NO, i.e.. no scene change is allowed in the multi- 
scene period as parental lock control reproducing only one reproduction path has been selected, the procedure moves 
35 to step #1300. 

At step #1300 the multi-angle flag VOB_Fm identifying whether the target connection scene is a multi-angle scene 
is reset, and the procedure moves to step #1 302. 

At step #1302 it is determined whether either the preceding VOB seamless connection flag VOB_Fsb or following 
VOB seamless connection flag VOB_Fsf is set. If step #1302 returns YES, i.e., the target connection scene seamlessly 
40 connects to the preceding, the following, or both the preceding and following scenes, the procedure moves to step 
#1304. 

At step #1304 the interleave flag VOB_Fi identifying whether the VOB, the encoded data of the target scene, is 
interleaved is set. The procedure then moves to step #1800. 

However, if step #1302 returns NO, i.e., the target connection scene does not seamlessly connect to the preceding 
45 or following scene, the procedure moves to step #1 306. 

At step #1306 the interleave flag VOB_Fi is reset, and the procedure moves to step #1800. 

If step #1200 returns YES, however, i. e., there is a multi-angle connection, the procedure moves to step #1400. 

At step #1400, the multi-angle flag VOB_Fm and interleave flag VOB_Fi are set. and the procedure moves to step 
#1500. 

so At step #1500 the encoding system controller 200 determines whether the audio and video can be seamlessly 
switched in a multi-angle scene period, i.e., at a reproduction unit smaller than the VOB, based on the scenario data 
St7. If step #1500 returns NO, i.e.. non-seamless switching occurs, the procedure moves to step #1600. 

At step #1600 the multi-angle seamless switching flag VOB_FsV indicating whether angle changes within the multi- 
angle scene period are seamless or not is reset, and the procedure moves to step #1800. 
55 However, if step #1500 returns YES, i.e., seamless switching occurs, the procedure moves to step #1700. 

At step #1700 the multi-angle seamless switching flag VOB_FsV is set and the procedure moves to step #1800. 
Therefore, as shown by the flow chart in Fig. 51 , encode parameter production (step #1 800) is only begun after the 
editing information is detected from the above flag settings in the scenario data St7 reflecting the user-defined editing 
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inS "S^on t» editing insaucfions OMcM .«« «» JJ^iSSSfSJSSSS 

M ' £5 on fhe encode parameters produced in step ..800. the video data and audio date are encoded in sup 

»fflh the preceding and following scenes * thereof. ™' ^ ^ an ^SoTme-bese. sub-picture data is 
video information for one frame and ^"^^fs^TS^S^ •*«*■•» <° sea "" 

£K£^S5322^^ - - * - 

^ C££3SL sfep in a fcop ~rrp.is.ng sfsps «00 to sfep -J^^^^SSE! 
man, ernes as mere are VOB S«a TNs loop! torn.* *. *T ^^^^am^ura. 

cedurethen moves to step #2200 J. s (VTST TVOBS) (Fig. 16) is 

and the encode parameters in Fig. 29. The procedure ^then moves to *«P«^a ^- SpGC information table 

57, 58, 59, and 60. . #1 « nn in Ra 51 r j S described next using Figs. 52, 

^^^^^^^^^^^ 

shown in Fig. 29. . „ n s_ in(3(i in » he scenario data St7 is extracted, the VOB 

At steD #1812 the scenario reproduction sequence (path) contained in tne ^ enB,l "r* 

Se, numTvOBslNO is set. a* me VOB ^W^-* « and me 

m^ir^eo^rss 

encode parameters shown in Fig 29. material end time VOB_VEND are extracted for 

each^V^ 

^ 'SSSSm the audio materia, start time VOB.AST of each VOB is extracted from the scenario data 817. and the 
audio encode start time A STTM is ^« <£* SSSJSor each VOB from the scenario data St7. and 

At step VS2S the audio JJ^jJEl ^islme e^aS at In audio access un* (AAU) is set as the audio 

encTe^^^^ 
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" ^ « aS^Sf-l W A STGAP obtained .rom .he d«.«*nce baween the »(deo encode star. time 

rale A.RATE is set as an au«o ancoctag ^i m « e '; ^ „ om „, e data St7. It the material is a film 

At step «1840 the initial wdeo encode ^JfTTST s* me . «~ parameter. 
VBVI^er* value set W ^ sa ^ ^ preced|ng ^ nunl3er 

B^B^o^e^ 

encode parameter. following connection is set to the following VOB number 

^rX information table and encode parameters are thus generated tor a multi-angle VOB Set with non- 
seamless muHi-angle switching control enabled^ seamless switching stream with multi-angle control is 
25 The process for generating the encode parameters of a se^iessswr^ g ^ ^ 
" described^ with reference to Fig 5^ This .stream .generated when «P#«0 « VOB_Fm = 1. and VOB_FsV 
TK S^r^^ l^^X^ in Pi, ,7 and Pig. * and »e encode 
30 VmK £^ } £&v^ »• -coding information tables shown in Fig. 27 and Fig. 28. and the encode 

parameters shown in Fig. 29. . . ^ Ariiainof i ; n the scenario data St7 is extracted, the VOB 

P At step #1850, the scenario reproducer s^u-K. fed* J^JS^,*' in the VOB Set. 

Set number VOBS.NO is set. and the VOB nunrWrVOB ^j^ 5 ^ J^' aae<) , romtnesceTOrtoaaB s,7.and m e 

^^^^^^^ 

At steo #1 856 the video encode GOP structure GOPb i vaiues in i ^ a 

4t with multi-angle control as described above. , parental lock control is imple- 

Tlre process to. generating the encode parameters tor ^ W»™ J£ 51 Iewrns no 

menWlsdesedbadt»lo»wilhre1er^to^^^»«»^> e ^^^ = , „ VOB.Fat- 1. VOB.Fp. t. 

" "tS^irenito^cSa^ 

Set numbe, VOBS.NO is set. and I the ; VOB JJuS tram the scanariodata St7. and «ta 

a^Set^ 
55 ting (= 1 ). . . . . wnR riivisions ILV D IV is extracted from the scenario data St7. 

S— SKK Sr^SSSS- ! ^r^* * as descrtoed re,e*ng to ate .tow chad in F,g. m. 
Further description thereof is thus omitted here. 
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The encode parameters of a system stream in which parental lock control is implemented are thus defined for a 
VOB Set with multi-scene selection control enabled as described above. 

The process for generating the encode parameters for a system stream containing a single scene '^described 
below with reference to Fig. 61 This stream is generated when step #900 in Fig. 51 returns NO. ..e.. when VOB_Fp=0. 
Trowing , operation produces the encoding information tab.es shown in Fig. 27 and Fig. 28. and the encode param- 

eter Ar s r p n #S £ scenario reproduction sequence (path) contained in the scenario data St7 is extracted, the VOB 
Set number VOBS NO is set. and the VOB number VOB.NO is set for one or more VOBin the VOB Set. 

#18827he maximum bit rate ILV_BR of the interleaved VOB is extractedfrom *e scenano MArtta 
maximum video encode bit rate VJvlRATE from the encode parameters is set based on the interleave flag VOB_F, set- 

tin9 Stop #1884 is the common VOB data setting routine, which is as described referring to the flow chart in Fig. 52. 

Further description thereof is thus omitted here. 

^esefkw charts for defining the encoding information table and encode parameters thus generate the parameters 
for DVD video, audio, and system stream encoding by the DVD formatter. 

Formatter flows 

The operation of the subroutine executed by the DVD formatter shown as step #2300 in Fig. 51 B is described next 
with!e?ere?ce o fSb 56 57. 58. 59. and 60. This formatter subroutine generates the DVD multimedia brtstream 

S^dWW encoder ECD 1 100 according to the present invention is described with reference to the 
flow chart in Fig. 56. Note that those steps shown in Fig. 56 with a double line are subroutines 

Aistep «3lb the program chain information VTS.PGCI is set to the VTSI management table VTSI_MAT for the 
nun*er of tiUes TITLE NUM based on the number of titles TITLE.NUM in the VOB Set data stream. 

T^Ip #^12 « is determined whether multi-scene selection control is enabled based on themum-scene flag 
VOB Fp fn the VOB Se, da ta stream. If step #2312 returns NO. i.e.. multi-scene control is not enabled, the procedure 

m ° V S 'S£i£ui» operation for coding a single scene (VOB) executed by the formatter 1 100 of the authoring 
encoder EC shown in Fig. 25 is accomplished. This routine is described later •*£ 

If steo #2312 returns YES, i.e.. multi-scene control is enabled, the procedure moves to step #21 16. 

At stop^SieTis determined whether the information is to be interleaved or not based on M Mm Mo 
VOB FfSteTn the VOB Set data stream. If step #2316 returns NO. i.e.. the information is not to be interleaved^ the 
pr^rfmUerto X #2314. If step #2316 returns YES, i.e.. the information is to be interleaved, the procedure 

™Tx siT^sl is determined whether multi-angle control is to be implemented based on the multi-angle ffag 
VOB Fmln the VOB Set data stream. If step #231 8 returns NO, the parental lock control routine .n step #2320 .s exe- 
nitPd If steD #2318 returns YES, the procedure moves to step #2322. 

At step #^20 tne operation for formatting the VOB Set for parental lock control is executed. Th,s subroutine .s 

ShOV £ Z g ^^^^ ™*-*' e seamleSS iS b S2 h ° n mU Tn 9 dn e sSm 

less switching flag VOB.FsV If multi-angle switching is accomplished without seamless swrtch.ng. ..e.. w,th non-seam- 
less switching and step #2322 returns NO, the procedure moves to step #2326. 

S multi-angle n^-seamless switching control routine executed in step #2326 by the formatter 1 100 of the author- 
ing encoder EC in Fig. 25 is described later with reference to Fig. 57. 

If multi-angle switching is accomplished with seamless switch.ng control. ..e.. step #2322 returns YES. the proce 

dUre TrmulLTg^am.ess switching contro. routine executed in step #2324 by the formatter 1 100 of the authoring 
encoder EC in Fia 25 is described later with reference to Fig. 58. 
^e cell playWclTnformation (PCG information entries C_PB.) of the VTS information VTSI set as prevously 

^A^S^ermined whether all VOB Sets declared by the VOB Set number VOBS , . have been 
processed the formatter. If NO. contro. loops back to step #2312. and the process runs agam. If YES. all sets have 

^efe^ aching control routine executed in step #2326 when step 

#2322 R 56 ^e^urns NO is described. This routine defines the interleaved arrangement of the ™rt,med,a brtstream 

T'JSiS^SZ ce.l playback information (C_PB.#i) shown in F,g. 1 6 and the informal stored to the navga- 
tinn nack NV shown in Fig. 20. in the generated DVD multimedia brtstream MBS. 

At2^3^s«J on trie multi angle flag VOB_Fm setting (= 1) declaring whether multi-angle control is applied 
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in the multi-scene period, the cell block mode CBM (Fig. 16) of the cell playback information blocks C_PBI #i containing 
the VOB control information for each scene is declared according to the position of the angle data. For example, the cell 
block mode CBM of the MA1 cell (Fig. 23) is declared as 01 b to indicate the beginning of the cell block, the CBM of M A2 
is declared as 10b to indicate a cell between the first and last cells in the block, and the CBM of MA3 is declared as 1 1b 
5 to indicate the end of the cell block. 

At step #2342 based on the multi-angle flag VOB_Fm setting (= 1 ) declaring whether multi-angle control is applied 
in the multi-scene period, the cell block type CBT (Fig. 16) of the cell playback information blocks C_PBI #i containing 
the VOB control information for each scene is declared as 01b to indicate an "angle." 

At step #2344 the seamless playback flag SPF (Fig. 16) is set to 1 in the cell playback information blocks C_PBI #i 
10 containing the VOB control information for each scene based on the preceding VOB seamless connection flag 
VOB_Fsb state, which. is set to 1 to indicate a seamless connection. 

At step #2346 the STC resetting flag STCDF is set to 1 in the cell playback information blocks C_PBI #i containing 
the VOB control information for each scene based on the preceding VOB seamless connection flag VOB_Fsb state, 
which is set to 1 to indicate a seamless connection. 
15 At step #2348 the interleaved allocation flag IAF (Fig. 16) is set to 1 in the cell playback information blocks C_PBI 
#i containing the VOB control information for each scene based on the multi-angle seamless switching flag VOB_FsV 
state, which is set to 1 to indicate interleaving is required. 

At step #2350 the location of the navigation pack NV (relative sector number from the VOB beginning) is detected 
from the title editing unit (VOB below) obtained from the system encoder 900 in Fig. 25. the navigation pack NV is 
20 detected based on the minimum interleaved unit presentation time ILVU_MT information (a formatter parameter 
obtained in step #1 81 6, Fig. 51), the location of the VOBU expressed as the number of sectors from the VOB beginning, 
for example, is thus obtained, and the title editing unit VOB is divided into interleave units using VOBU units. 

For example, if in this example the minimum interleaved unit presentation time ILVU_MT is 2 sec and the presen- 
tation time of one VOBU is 0.5 sec, then the VOB is divided into interleave units of 4 VOBU each. Note that this alloca- 
25 tion operation is applied to the VOB constituting each multi-scene data unit 

At step #2352 the interleave units of each VOB obtained from step #2350 are arranged in the cell block mode CBM 
sequence (cell block beginning, middle, and end cells) written as the VOB control information for each scene in step 
#2340 to form the interleaved blocks as shown in Fig. 37 or 38. The interleaved blocks are then added to the VTS title 
VOBS (VTSTT_VOBS). Using the cell block mode CBM declarations above, for example, the angle data MA1 , MA2, 
30 and MA3 (Fig. 23) are arranged in that sequence. 

At step #2354 the relative sector number from the VOBU start is written to the VOB end pack address VOBU_EA 
(Fig. 20) in the navigation pack NV of each VOBU based on the VOBU position information obtained in step #2350. 

At step #2356 the first cell VOBU start address C_FVOBU_SA and the last cell VOBU start address C_LVOBU_SA 
expressed as the number of sectors from the beginning of the VTS title VOBS (VTSTT_VOBS) are written as the 
35 addresses of the navigation packs NV of the first and last VOBU in each cell based on the VTS title VOBS 
(VTSTT_VOBS) data obtained in step #2352. 

The angle #i VOBU start address N SM L_AG L_C 1 _DSTA - NSML_AGL_C9_DSTA of the non-seamless angle 
information NSML_AGLI (Fig. 20) in the navigation pack NV of each VOBU is written at step #2358. This address is 
expressed as the relative sector number inside the data of the interleaved blocks formed in step #2352. and declares 
40 the address information (Fig. 50) of the navigation pack NV contained in the VOBU of all angle scenes near the pres- 
entation start time of the VOBU being processed. 

At step #2360 "7FFFFFFFh" is written to the angle #i VOBU start address NSML_AGL_C1_DSTA - 
N SM L_AGL_C9_D STA of the non-seamless angle information NSML_AGLI (Fig. 20) in the navigation pack NV of each 
VOBU if the VOBU being processed is the last VOBU of each scene in the multi-scene period. 
45 This routine thus formats the interleaved blocks for multi-angle non-seamless switching control in the multi-scene 
period, and formats the cell control information as the reproduction control information for those multiple scenes. 

Referring to Fig. 58. the multi-angle seamless switching control routine executed in step #2324 when step #2322, 
Fig. 56, returns YES is described. This routine defines the interleaved arrangement of the multimedia bitstream MBS. 
the content of the cell playback information (C_PBI#Q shown in Fig. 16, and the information stored to the navigation 
50 pack NV shown in Fig. 20, in the generated DVD multimedia bitstream MBS. 

At step #2370 based on the multi-angle flag VOB_Fm setting (= 1) declaring whether multi-angle control is applied 
in the multi-scene period, the cell block mode CBM (Fig. 1 6) of the cell playback information blocks C_PBI #i containing 
the VOB control information for each scene is declared according to the position of the angle data. For example, the cell 
block mode CBM of the MA1 cell (Fig. 23) is declared as 01 b to indicate the beginning of the cell block, the CBM of MA2 
55 is declared as 10b to indicate a cell between the first and last cells in the block, and the CBM of MA3 is declared as 1 1b 
to indicate the end of the cell block. 

At step #2372 based on the multi-angle flag VOB_Fm setting (= 1) declaring whether multi-angle control is applied 
in the multi-scene period, the cell block type CBT (Fig. 16) of the cell playback information blocks C_PBI #i containing 
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the VOB control information for each scene is declared as 01b to indicate an "angle." 

At step #2374 the seamless playback flag SPF (Fig. 16) is set to 1 in the cell playback information blocks C_PBI #i 
containing the VOB control information for each scene based on the preceding VOB seamless connection flag 
VOB_Fsb state, which is set to 1 to indicate a seamless connection. 
5 At step #2376 the STC resetting flag STCDF is set to 1 in the cell playback information blocks C_PBI #i containing 

the VOB control information for each scene based on the preceding VOB seamless connection flag VOB_Fsb state, 
which is set to 1 to indicate a seamless connection. 

At step #2378 the interleaved allocation flag IAF (Fig. 16) is set to 1 in the cell playback information blocks C_PBI 
#i containing the VOB control information for each scene based on the multi-angle seamless switching flag VOB_FsV 
io state, which is set to 1 to indicate interleaving is required. 

At step #2380 the location of the navigation pack NV (relative sector number from the VOB beginning) is detected 
from the title editing unit (VOB below) obtained from the system encoder 900 in Fig. 25, the navigation pack NV is 
detected based on the minimum interleaved unit presentation time ILVU_MT information (a formatter parameter 
obtained in step #1 854, Fig. 53), the location of the VOBU expressed as the number of sectors from the VOB beginning, 
75 for example, is thus obtained, and the title editing unit VOB is divided into interleave units using VOBU units. 

For example, if in this example the minimum interleaved unit presentation time ILVUJvIT is 2 sec and the presen- 
tation time of one VOBU is 0.5 sec., then the VOB is divided into interleave units of 4 VOBU each. Note that this alloca- 
tion operation is applied to the VOB constituting each multi-scene data unit. 

At step #2382 the interleave units of each VOB obtained from step #2380 are arranged in the cell block mode CBM 
20 sequence (cell block beginning, middle, and end cells) written as the VOB control information for each scene in step 
#2360 to form the interleaved blocks as shown in Fig. 37 or 38. The interleaved blocks are then added to the VTS title 
VOBS (VTSTT_VOBS). Using the cell block mode CBM declarations above, for example, the angle data MA1, MA2, 
and MA3 (Fig. 23) are arranged in that sequence. 

At step #2384 the relative sector number from the VOBU start is written to the VOB end pack address VOBU_EA 
25 (Fig. 20) in the navigation pack NV of each VOBU based on the VOBU position information obtained in step #2360. 

At step #2386 the first cell VOBU start address C_FVOBU_SA and the last cell VOBU start address CJ_VOBU_SA 
expressed as the number of sectors from the beginning of the VTS title VOBS (VTSTT_VOBS) are written as the 
addresses of the navigation packs NV of the first and last VOBU in each cell based on the VTS title VOBS 
(VTSTT_VOBS) data obtained in step #2382. 
30 At step #2388 the relative sector number from the VOBU start is written to the VOB end pack address VOBU_EA 
(Fig. 20) in the navigation pack NV of each VOBU based on the interleave unit data obtained in step #2370. 

The angle #i VOBU start address SML_AGL_C1_DSTA - SML_AGL_C9_DSTA of the seamless angle information 
SML_AGLI (Fig. 20) in the navigation pack NV of each VOBU is written at step #2390. This address is expressed as the 
relative sector number inside the data of the interleaved blocks formed in step #2382, and declares the address infor- 
35 mation (Fig. 50) of the navigation pack NV contained in the VOBU of all angle scenes with a start time contiguous to 
the reproduction end time of the VOBU being processed. 

At step #2392 "7FFFFFFFh" is written to the angle #i VOBU start address SML_AGL_C1_DSTA - 
SML_AGL_C9_DSTA of the seamless angle information SML_AGLI (Fig. 20) in the navigation pack NV of the VOBU 
contained in the interleaved unit if the interleave unit arranged in step #2382 is the last interleave unit of each scene in 
40 the multi-scene period. 

This routine thus formats the interleaved blocks for multi-angle seamless switching control in the multi-scene 
period, and formats the cell control information as the reproduction control information for those multiple scenes. 

The parental lock subroutine (step #2320, Fig. 56) executed when step #2318 in Fig. 56 returns NO, i.e., when it is 
determined that parental lock control is implemented and not multi-angle control, is described next with reference to Fig. 
45 59. 

The parental lock subroutine described below writes the interleave unit arrangement of the multimedia bitstream, 
the content of the PGC information entries C_PBI #i (cell playback information) shown in Fig. 16, and the navigation 
pack NV information shown in Fig. 20, to the generated DVD multimedia bitstream. 

At step #2402 a value "00b" is written to the cell block mode CBM (Fig. 16) of the cell playback information blocks 
so C_PBI #i containing the VOB control information for each scene based on the multi-angle flag VOB_Fm state, which is 
set to 0 to indicate that multi-angle control is not enabled in the multi-scene period. 

At step #2404 the seamless playback flag SPF (Fig. 16) is set to 1 in the cell playback information blocks C_PBI #i 
containing the VOB control information for each scene based on the preceding VOB seamless connection flag 
VOB_Fsb state, which is set to 1 to indicate a seamless connection. 
55 At step #2406 the STC resetting flag STCDF is set to 1 in the cell playback information blocks C_PBI #i containing 
the VOB control information for each scene based on the preceding VOB seamless connection flag VOB_Fsb state, 
which is set to 1 to indicate a seamless connection. 

At step #2408 the interleaved allocation flag IAF (Fig. 16) is set to 1 in the cell playback information blocks C_PBI 
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the VOB control information for each scene based on the preceding VOB seamless connection flag VOB.Fsb state. 
^ returns NO, i.e.. there is not a seam.ess connection to the preceding scene, the procedure moves to 

containing the VOB control information for each scene based on the preceding 

the %£S«wLn***> to, each scene based an the preceding VOB seamless conneahon flag VOB.Fsb slate. 

igation pack NV (Fig. 20), to the produced DVD multimedia brtstream. 
is Decoder flo w charts 

nick-tn-stream bi'*^ transfer flow 

register. The scenario information «M records «*»?™"^J£ I ^J££Lan register extracts aad 
scenario information extracted into the scenario information register sub . reaisters ie the angle number 

^VTSn^^Jeg recces «. number a. ft. next VTS rep^duced earn among ft. pfura, VTS an the 
""Nh. PGC number VTS.PGCI_NO.rag recorfs which at the plura! pragram chains PGC present in the .idea «f. 

C_LVOBU_SA_reg. w^ov.™ «*.~thpr nlural cells constitute one functional block. If there 

" "^ecet^^CBT.regstoresavratuede.^ 

» me c.» tfc* is a ""^^^J^^u iS*JS3^iSSi is seamtess connected with the 

ss is not specified. NSML is stored. identifying whether the cell exists in a cantiguaus at inter- 
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when the cell is reproduced; when resetting the system time clock STC is necessary. STC_RESET is stored; if resetting 
is not necessary. STCJM RESET is stored. 

The seamless angle change flag SACF_reg stores a value indicating whether a cell in a multi-angle period should 
be connected seamlessly at an angle change. If the angle change is seamless, the seamless angle change flag SACF 
5 is set to SML; otherwise it is set to NSML. 

The first cell VOBU start address C_FVOBU_SA_reg stores the VOBU start address of the first cell in a block. The 
value of this address is expressed as the distance from the logic sector of the first cell in the VTS title VOBS 
(VTSTT_VOBS) as measured by and expressed (stored) as the number of sectors. 

The last cell VOBU start address C_LVOBU_SA_reg stores the VOBU start address of the last cell in the block. The 
10 value of this address is also expressed as the distance from the logic sector of the first cell in the VTS title VOBS 
(VTSTT_VOBS) measured by and expressed (stored) as the number of sectors. 

The decoding table shown in Fig. 63 is described below. As shown in Fig. 63, the decoding table comprises the fol- 
lowing registers: information registers for non-seamless multi-angle control, information registers for seamless multi- 
angle control, a VOBU information register, and information registers for seamless reproduction. 
15 The information registers for non-seamless multi-angle control comprise sub-registers NSML_AGL_C1_DSTA_reg 
- N SM L_AGL_C9_D STA_r eg . 

NSML_AGL__C1_DSTA_reg - NSM L_AG L_C9_DSTA_r eg record the NMS L_AGL_C 1 _DSTA 
NMSL_AGL_C9_DSTA values in the PCI packet shown in Fig. 20. 

The information registers for seamless multi-angle control comprise sub-registers SM L_ AG L_C 1 _D STA__r eg - 
20 SM L_AG L_C9_D STA__r eg . 

SML_AGL_C1 _DSTA_reg - SM L_AGL_C9_D STA_reg record the SMLAGL_C1_DSTA - SML_AGL_C9_DSTA 

values in the DSI packet shown in Fig. 20. 

The VOBU information register stores the end pack address VOBU_EA in the DSI packet shown in Fig. 20. 

The information registers for seamless reproduction comprise the following sub-registers: an interleaved unit flag 
25 ILVU_flag_reg, Unit END flag UNIT_ENDJIag_reg, Interleaved Unit End Address ILVU_EA_reg, Next Interleaved Unit 
Start Address NTJLVU_SA_reg, the presentation start time of the first video frame in the VOB (Initial Video Frame 
Presentation Start Time) VOB_V_SPTM_reg, the presentation end time of the last video frame in the VOB (Final Video 
Frame Presentation Termination Time) VOB_V_EPTM_/eg, audio reproduction stopping time 1 
VOB_A_STP_PTM1_reg, audio reproduction stopping time 2 VOB_A_STP_PTM2_reg, audio reproduction stopping 
30 period 1 VOB_A_GAP_J-E N 1 _reg , and audio reproduction stopping period 2 VOB_A_GAP_LEN2_reg. 

The interleaved unit flag ILVU_flag_reg stores the value indicating whether the video object unit VOBU is in an 
interleaved block, and stores ILVU if it is. and NJLVU if not. 

The Unit END flag UN1T_ENDJIag_reg stores the value indicating whether the video object unit VOBU is the last 
VOBU in the interleaved unit ILVU. Because the interleaved unit ILVU is the data unit for continuous reading, the 
35 UNIT_ENDJIag_reg stores END if the VOBU currently being read is the last VOBU in the interleaved unit ILVU. and 
otherwise stores N_END. 

The Interleaved Unit End Address ILVU_EA_reg stores the address of the last pack in the ILVU to which the VOBU 
belongs if the VOBU is in an interleaved block. This address is expressed as the number of sectors from the navigation 
pack NV of that VOBU. 

40 The Next Interleaved Unit Start Address NTJLVU_SA_reg stores the start address of the next interleaved unit ILVU 
if the VOBU is in an interleaved block. This address is also expressed as the number of sectors from the navigation pack 
NV of that VOBU. 

The Initial Video Frame presentation Start Time register VOB_V_SPTM_reg stores the time at which presentation 
of the first video frame in the VOB starts. 
45 The Final Video Frame Presentation Termination Time register VOB_V_EPTM__reg stores the time at which pres- 
entation of the last video frame in the VOB ends. 

The audio reproduction stopping time 1 VOB_A_STP_PTM 1_reg stores the time at which the audio is to be paused 
to enable ^synchronization, and the audio reproduction stopping period 1 VO B_A_GAP_LE N 1 _reg stores the length 
of this pause period. 

so The audio reproduction stopping time 2 VOB_A_STP_PTM2_reg and audio reproduction stopping period 2 
VO B_A_GAP_LE N2_reg store the same values. 

The operation of the DVD decoder DCD according to the present invention as shown in Fig. 26 is described next 
below with reference to the flow chart in Rg. 69. 

At step #310202 it is first determined whether a disk has been inserted. If it has, the procedure moves to step 

55 #310204. 

At step #310204, the volume file structure VFS (Rg. 21) is read, and the procedure moves to step #310206. 
At step #310206. the video manager VMG (Fig. 21) is read and the video title set VTS to be reproduced is 
extracted. The procedure then moves to step #310208. 
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At step #310208. the video title set menu address information VTSM_C_ADT is extracted from the VTS information 
^CSSSIS: r^Zu^SM.VOBS is read from the disk based on the v«eo - set menu 

titles^ no user-selectable content, and titles containing audio numbers, sub-picture numbers. °r mult.ar^ e s^ne 
conte^ the usi m ust also enter the desired angle number. Once the user se.ect.on ,s completed, the procedure 

m ° V S VTS PGC. #i program chain (PGC) data block corresponding to the «• ~rntan selected by 

-™^f== 

finish^ Sfe dtoding process ends. If a separate ««e is thereafter to be ^produced as determined by monrfonng Key 

oJ 'SpGC ^rep^uction routine consists of steps #31030. #31032. #31034. and #31035 as shown. 

At sSp #?i03?the decoding system table shown in Fig. 62 is defined. The number ANG^ NO reg VTS 
purser WS NO ?eg PGC numb* VTS_PGC._NO_reg. audio ID AUD.OJD reg, and sub-p,cture ID SPJD_reg are 

STC resetting flag STCDF. seamless angle change flag SACF_reg. f.rst cell VOBU start address C_FVOBU_SA_reg. 
and last cell VOBU start address C_LVOBU_SA_reg. etrMm /^po #310321 and 

Once the decoding system table is defined, the process transfernng data te .the ^sfream buffer (step #31032) and 

the process for decoding and reproducing the data stored to the stream buffer 2400. 

Nn) . that sten #31032 and step #31034 are executed in parallel. . 

Note that step hiuk : b h processing one cell is completed, rt is determined in step 

SSSS=S=.™ K3SS 

Sh Tt s^UToS it is determined whether the ce.l is a mufti-angle ce... ff not. the procedure moves to step #30144. 

^ef^*^ ce... the procedure moves to step #30142 

where^amfeS a^-ge f!ag SACF is eva.uated to determine whether seam.ess ang.e reproduction ,s spec- 
if seamless angle reproduction is specified, the seamless multi-angle decoding process is executed in step 
^ ^S^rZ^on is not specified, the non-seam.ess mutti-ang.e decodmg process ,s executed ,n 

SteP rnon-mu W -ang.e ce.. decoding process (step #31044. ^^^^^ToTS^ aS 
. 71. Note that the non-multi-angle cell decoding process (step #31044) compnses the steps #31050. #3io^. an 

^efirst step #31050 evaluates the interleaved a.location f lag ^^^SSS^ ^ " " 

leaved block. If it is the non-multi-angle interleaved block process is executed in step #31052. 

The non-multi-angle interleaved block process (step #31052) processes scene branching and connect.cn where 

; seam reT^^^ 



50 



EP0 847 200 A1 



Fig. 72. 

At step #31060 the reading head 2006 is jumped to the first cell VOBU start address C_FVOBU_SA read from the 
C_FVOBU_SA_reg register. 

More specifically, the address data C_FVOBU_SA_reg stored in the decoding system controller 2300 (Fig. 26) is 
5 input as bitstream reproduction control signal St53 to the reproduction controller 2002. The reproduction controller 2002 
thus controls the recording media drive unit 2004 and signal processor 2008 to move the reading head 2006 to the 
specified address, data is read, error correction code ECC and other signal processing is accomplished by the signal 
processor 2008. and the cell start VOBU data is output as the reproduced bitstream St61 to the stream buffer 2400. The 
procedure then moves to step #31062. 
10 At step #31062 the DSI packet data in the navigation pack NV (Fig. 20) is extracted in the stream buffer 2400, the 
decoding table is defined, and the procedure moves to step #31064. The registers set in the decoding table are the 
ILVU_EA_reg, NT_I LVU_SA_reg , VOB_V_SPTM_reg, VOB_V_EPTM_reg, VOB_A_STP_PTM1_reg, 
VOB_A_STP_PTM2_reg, VO B_A_G AP_LE N 1 _r eg , and VOB_A_GAP_LEN2_reg. 

At step #31064 the data from the first cell VOBU start address C_FVOBU_SA_reg to the ILVU end pack address 
75 ILVU_EA_jeg. i.e., the data for one interleaved unit ILVU, is transferred to the stream buffer 2400. The procedure then 
moves to step #31 066. 

More specifically, the address data ILVU_EA_reg stored in the decoding system controller 2300 (Fig. 26) is sup- 
plied to the reproduction controller 2002. The reproduction controller 2002 thus controls the recording media drive unit 
2004 and signal processor 2008 to read the data to the ILVU_EA_reg address, and after error correction code ECC and 
20 other signal processing is accomplished by the signal processor 2008, the data for the first ILVU in the cell is output as 
the reproduced bitstream St61 to the stream buffer 2400. It is thus possible to output the data for one contiguous inter- 
leaved unit ILVU on the recording medium M to the stream buffer 2400. 

At step #31066 it is determined whether ail interleaved units in the interleaved block have been read and trans- 
ferred. If the interleaved unit ILVU processed is the last ILVU in the interleaved block, "0x7FFFFFFF M indicating termi- 
25 nation is set to the next-ILVU start address NTJLVU_SA_reg as the next read address. If all interleaved units in the 
interleaved block have thus been processed, the procedure moves to step #31068. 

At step #31068 the reading head 2006 is again jumped to the address NTJLVU_SA_reg of the next interleave unit 
to be reproduced, and the procedure loops back to step #31062. Note that this jump is also accomplished as described 
above, and the loop from step #31 062 to step #31 068 is repeated. 
30 However, if step #31066 returns YES, i.e., all interleaved unit ILVU in the interleaved block have been transferred, 
step #31052 terminates. 

The non-multi-angle interleaved block process (step #31 052) thus transfers the data of one cell to the stream buffer 
2400. 

The non-multi-angle contiguous block process is executed in step #31054, Fig. 71 , is described further below with 
35 reference to Fig. 73. 

At step #31070 the reading head 2006 is jumped to the first cell VOBU start address C_FVOBU_SA read from the 
C_FVOBU_SA_reg register. This jump is also accomplished as described above, and the loop from step #31072 to step 
#31076 is initiated. 

At step #31072 the DSI packet data in the navigation pack NV (Fig. 20) is extracted in the stream buffer 2400, the 
40 decoding table is defined, and the procedure moves to step #31074. The registers set in the decoding table are the 
VOBU_EA_reg, VOB_V_SPTMjreg, VOB_V_EPTM_reg, VOB_A_STP_PTM1_reg. VOB_A_STP_PTM2_reg. 
VOB_A_G AP_LE N 1 _reg , and VOB_A_GAP_LE N2_reg . 

At step #31074 the data from the first ceil VOBU start address C_FVOBU_SA_reg to the end pack address 
VOBU_EA_jeg, i.e., the data for one video object unit VOBU, is transferred to the stream buffer 2400. The procedure 
45 then moves to step #31076. The data for one video object unit VOBU contiguously arrayed to the recording medium M 
can thus be transferred to the stream buffer 2400. 

At step #31 076 it is determined whether all cell data has been transferred. If all VOBU in the cell has not been trans- 
ferred, the data for the next VOBU is read continuously, and the process loops back to step #31070. 

However, if all VOBU data in the cell has been transferred as determined by the C_LVOBU_SA_reg value in step 
so #31076. the non-multi-angle contiguous block process (step #31054) terminates. This process thus transfers the data 
of one cell to the stream buffer 2400. 

Another method of accomplishing the non-multi-angle cell decoding process (step #31044, Fig. 70) is described 
below with reference to Fig. 74. 

At step #31080 the reading head 2006 is jumped to the first cell VOBU start address C_FVOBU_SA_reg, and the 
55 first VOBU data in the cell is transferred to the stream buffer 2400. The procedure then moves to step #31081 . 

At step #31081 the DSI packet data in the navigation pack NV (Fig. 20) is extracted in the stream buffer 2400, the 
decoding table is defined, and the procedure moves to step #31082. The registers set in the decoding table are the 
SCR_buffer, VOBU_EA_reg. ILVU_flagL_reg, UNIT_ENDJIag_reg. ILVU_EA_reg, NT_ILVU_SA_reg, 
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VOB V SPm reg. VOB.VJEPTM.reg, VOB_A_STP_PTM1_reg. VOB_A_STP_PTM2_reg. 

^ address C_FVOBU_SA_reg to the end pa* address 
VOBU STrS. E £ ? <2£ £ one ^dlo object unit VOBU, is transferred to the stream butter 2400. The procedure 

then A 7s^£l1»S?deSmined whether a., cel. VOBU data has been transferred. If it has. the process (step #31044) 

badTfo slep #31081 . If so. the procedure advances to step #31085. It is thus poss,ble to transfer one cel. of data ,n 
VOBU units to the stream buffer 2400. , ^ 

- in - — lf - step 

5 f h^i to So #31081 It is thus possible to transfer the data for one cell to the stream buffer 2400. 
^ SS^SSSS^^ executed in step #30146. Rg. 70. is described below referring to Rg. 

75 At sien #31 090 the reading head 2006 is jumped to the first cell VOBU start address C_FVOBU_SA read from the 
C FVOBU £ ™ Ser and I the first VOBU data in the cel. is transferred to the stream buffer 2400. ^procedure 
fhTrnSesfc i^mm jump is also accomplished as described above, and the .oop from step #31 091 to step 

^TSlISa 2 -^ the ANGLE NO reg is updated, and the procedure moves to step #31094. This update operation 
rJH SSJSS ^ Z Ttolhe angle' nurTer of the angle selected by the user when the user changes the ang.e 

Ser^^SlOl - This jump is also accomplished as described above, and the loop from step #31 101 to step 
"^liSSl the DS. packet data in the navigation pack NV (Rg. 20) is extracted in tte stream buffer MM 

S£ A^P™ S^SoB A^i^^hBjid^JJBl^ and VOB_A_GAP_LEN2_reg. 

V£ «l^hh?date fTom the'first cell VOBU start address C_FUOBU_SA_reg to the end pack address 
VOBU iTrS e the £ai tor one VOBU. is transferred to the stream buffer 2400. The procedure then moves to step 
«1 1^Sus poss^Io output the data for one contiguous video object unit VOBU on the record.ng med.um M to 

^ t^^nrfaTe ANGLE NO reg is updated, and the procedure moves to step #31 104. This update operation 
, rese^Sr^GLE ^ ^e angSnunSr of the ang.e selected by the user when the user changes the angle 

^^r^L^ZiZ^^er the angle cell data has all been transferred. H all VOBU in the cel. have not 
been tra^e^ the proceduTe moves to step #31105. If all VOBU in the cel. have been transferred, the process te, 
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minates. 

At step #31105 the reading head 2006 is jumped to the next angle (NSML_AGL_C#n_reg). and the process 
advances to step #31 106. Note that NSML_AGL_C#n_reg is the address of the angle to which the ANGLE_NO_reg 
was updated in step #31 1 03. 

5 It is thus possible to transfer the data for the angle selected by the user to the stream buffer 2400 in VOBU units. 

Step #31 106 is an effective step for high speed angle switching, and simply clears the stream buffer 2400. By thus 
clearing the stream buffer 2400 the data for the newly selected angle can be reproduced without reproducing the angle 
data that is still not decoded. In other words, clearing the stream buffer 2400 enables faster response to user opera- 
tions. 

w ft is very important that DVD decoder according to the present invention can promptly moves to the next data read- 
ing process and effectively performs the data reading once after the detection of the end of data such as interleave unit 
ILVU and video object unit VOBU for the sake of seamless reproduction which is one of main targets of the present 
invention. 

With reference to Fig. 66, a construction of the stream buffer 2400 which can performs the end detection of inter- 
15 leave unit ILVU is described briefly. The stream buffer 2400 comprises a VOB buffer 2402, a system buffer 2404, a nav- 
igation pack extractor 2406, and a data counter 2408. The system buffer 2404 temporarily stores the title control data 
VTSI(Fig. 16) included in signal St61 , and outputs a control information St2450 (St63) such as VTS_PGC. 

The VOB buffer 2402 temporarily stores the title VOB data VTSTT_VOB (Fig. 1002), and the stream St67 to the 
system decoder 2500. 

20 The NV (navigation pack) extractor 2406 receives the VOB data at the same time with the VOB buffer 2402, and 
extracts the navigation pack NV therefrom. The NV extractor 2406 furthermore extracts the VOBU final pack address 
COBU_EA or ILVU final pack address ILVU_EA which are the DSI generation information DSM3I shown in Fig. 19 to 
produce a pack address information St2452 (St63). 

The data counter 2408 receives the VOB data at the same time with the VOB buffer 2402, and counts each of pack 

25 data shown in Fig. 1 9 byte by bye. Then, the data counter 2408 produces a pack input terminating signal St2454 (St63) 
at the time when the inputting of pack data is completed. 

Due to its construction shown in Fig. 66, the stream buffer 2400 performs the VOBU data transfer as examples at 
step #31064 of Fig. 72, as follows. The stream buffer 2400 outputs the VOBU data for the NV extractor 2406 and data 
counter 2408 at the same time when the VOBU buffer 2400 receives the VOBI data on the top of interleave unit ILVU. 

30 As a result, the NV extractor 2406 can extracts the data of ILVU_EA and NT_ILVU_SA at the same time with the input- 
ting of navigation pack data NV, and outputs thereof as signal St2452 (St63) to the decode system controller 2300 (Fig. 
26). 

The decode system controller 2300 stores the signal St2452 into the ILVU_EA_reg and NT_ILVU_SA_reg, and 
then start to counts the number of packs based on the pack terminating signal 2452 from the data counter 2408. Based 

35 on the fore mentioned the counted value of packs and ILVU_EA_reg, the decode system controller 2300 detects the 
instance when the inputting of final pack data of ILVU is completed, or the inputting final byte data of the final pack of 
the ILVU is completed. Then, the controller 2300 further give a command for the bitstream reproducer 2000 to move to 
the position having a sector address indicated by NTJLVU_SA_reg. The bitstream producer 2000 moves to the sector 
address indicated NT_ILVU_SA_reg, and starts to read the data. Thus, the detection of final end of ILVU and reading 

40 process for the next ILVU can be performed effectively. 

In the above, an example where the multimedia data MBS is reproduced by the bitstream reproducer 2000 without 
a buffering process, and is inputted to the stream buffer 2499. However, in the case that the signal processor 2008 of 
the bitstream reproducer 2000 is incorporated with a buffer for error correction process, for example, the controller 2300 
gives a moving command to reproducer 2000 so that the reproducer 2000 moves to the reading position indicated by 

45 NT_ILVU_SA_reg after completion of the final pack data of fore mentioned ILVU and clearing the internal buffer of the 
reproducer 2000. Thus, the effective reproduction of ILVU data even when the bitstream reproducer 2000 includes a 
buffer for error correction code (ECC) process. 

Furthermore, when the bitstream producer 2000 has a buffer for ECC process, the data can be transferred effec- 
tively by providing any means having a function equivalent to that of data counter 2408 (Fig. 66). In other words, the 

so bitstream reproducer 2000 generates the pack input completion signal St62; the decode system controller 2300 gives 
a command based on the signal St62 the bitstream reproducer 200 to move to the reading position having sector 
address designated by NT_ILVU_SA_reg. As apparent from the above, the data can be transferred effectively even 
when the bitstream reproducer 2000 has a function to buffer the data reproduced from the recording media M. 

It is to be noted that the apparatus and method substantially the same as those described in the above with respect 

55 to the interleave unit ILVU can be used for the detection VOBU end. In other words, by replacing the extraction of 
ILVU_EA and NT_ILVU_Sa. and the storing of ILVU_EA_reg and NT_ILVU_SA_reg with the extraction of VOBU_EA 
and storing VOBU_EA_reg, the apparatus and method according to the present invention, described above, can be 
used for the detection of an_@end. This is effective for the VOBU data transferring operations shown at steps #31074, 
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#31082, #31092, and #31102. 

Thus, the reading of data such as ILVU and VOBU can be performed effectively. 

Decoding flows in the stream buffer 

5 

The process for decoding data in the stream buffer 2400 shown as step #31 034 in Fig. 64 is described below refer- 
ring to Fig. 67. This process (step #31034) comprises steps #31 1 10, #31 1 12. #31114, and #31 1 16. 

At step #31110 data is transferred in pack units from the stream buffer 2400 to the system decoder 2500 (Fig. 26). 
The procedure then moves to step #31112. 
10 At step #31 112 the pack data is from the stream buffer 2400 to each of the buffers, i.e., the video buffer 2600, sub- 
picture buffer 2700, and audio buffer 2800. 

At step #31 1 12 the Ids of the user-selected audio and sub-picture data, i.e., the audio ID AUDIO_ID_reg and the 
sub-Picture ID SP_ID_reg stored to the scenario information register shown in Fig. 62, are compared with the stream 
ID and sub-stream ID read from the packet header (Fig. 1 9), and the matching packets are output to the respective buff- 
is ers. The procedure then moves to step #31114. 

The decode timing of the respective decoders (video, sub-picture, and audio decoders) is controlled in step 
#31 1 14, i.e., the decoding operations of the decoders are synchronized, and the procedure moves to step #31 116. 

Note that the decoder synchronization process of step #31 1 14 is described below with reference to Fig. 68. 

The respective elementary strings are then decoded at step #31116. The video decoder 3801 thus reads and 
20 decodes the data from the video buffer, the sub-picture decoder 31 00 reads and decodes the data from the sub-picture 
buffer, and the audio decoder 3200 reads and decodes the data from the audio buffer. 

This stream buffer data decoding process then terminates when these decoding processes are completed. 

The decoder synchronization process of step #31 1 14, Fig. 67, is described below with reference to Fig. 68. This 
processes comprises steps #31 120, #31 122, and #31 124. 
25 At step #31 120 it is determined whether a seamless connection is specified between the current cell and the pre- 
ceding cell. If a seamless connection, the procedure moves to step #31 122, if not. the procedure moves to step #31 1 24. 

A process synchronizing operation for producing seamless connections is executed in step #31 122, and a process 
synchronizing operation for non-seamless connections is executed in step #31 124. 

A multi-angle system comprising plural system streams each comprising audio and video data representing views 
30 from different perspectives is thus constructed by means of the present invention described above. Using the resulting 
multi-angle system stream, an observer can during media presentation dynamically and freely switch the system 
streams representing each of the angles at predefined unit intervals. To accomplish this, the system streams are con- 
structed such that the presentation times of the video data and the presentation times of the audio data contained in 
the system streams corresponding to each angle are the same in each of the particular units at which switching 
35 between angles is possible. As a result, when the user changes from one angle to another angle at any desired point 
within a multi-angle scene period as though changing the camera position, the video will not be disturbed or intermitted, 
the audio will not be disrupted with noise or intermitted, and the audio and video can thus be smoothly switched. 

Moreover, because the audio data contained in the system streams corresponding to each angle is the same in the 
angle switching units of each angle, the audio will not be disrupted with noise or intermitted, and the audio can thus be 
40 smoothly switched when the user changes from one angle to another angle at any desired point within a multi-angle 
scene period as though changing the camera position. 

Industrial Applicability 

is As apparent from the above, a method and an apparatus, according to the present invention, for interleaving a bit 
stream and recording the interleaved bitstream on a recording medium and reproducing the recorded interleaved bit- 
stream therefrom is suitable for an authoring system which can generate a new title by editing a title constructed by bit- 
streams carrying various information in accordance with the user's request, and is also suitable for a recently developed 
Digital Video System, or DVD system. 

so 

Claims 

1. An interleaving method which for the presentation of a bitstream that is reproduced by selecting two or more data 
units (VOB) from a bitstream comprising three or more data units (VOB) contiguous on the same time-base is char- 
15 acterized by generating said bitstream by arranging the selected data units (VOB) in a particular sequence on the 
same time-base based on the presentation time of each data unit (VOB) so that it is posstole to sequentially access 
all data units (VOB) and present only the selected data units (VOB) without time-base intermittence, and is charac- 
terized by dividing said data units (VOB) into shortest-read -time data units (ILVU), and generating said bitstream 
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by arranging the shortest-read-time data units (ILVU) in a particular sequence on the same time-base based on the 
shortest read time each shortest-read-time data unit (ILVU) so that it is possible to sequentially access all shortest- 
read-time data units (ILVU) and present only the shortest-read-time data units (ILVU) of the selected data units 
(VOB) without time-base intermittence, said presentation time of each shortest-read-time data unit (ILVU) being the 
5 same. 

2. An interleaving method according to Claim 1 wherein the shortest-read-time data units (ILVU) are the same audio 
data. 

10 3. An interleaving method according to Claim 2 wherein the shortest-read-time data units (ILVU) are completely self- 
contained data units, the presentation of which does not require referencing any other shortest-read-time data unit 
(ILVU). 

4. An interleaving method according to Claim 2 wherein the shortest-read-time data units (ILVU) are closed 
15 group_of_pictures GOP based on the MPEG standard. 

5. An interleaving method which for the presentation of a bitstream that is reproduced by selecting two or more data 
units (VOB) from a bitstream comprising three or more data units (VOB) contiguous on the same time-base is char- 
acterized by generating said bitstream by arranging the selected data units (VOB) in a particular sequence on the 

20 same time-base based on the presentation time of each data unit (VOB) so that it is possible to access and present 
without time-base intermittence only the selected data units (VOB), and is characterized by dividing said data units 
(VOB) into shortest-read-time data units (ILVU), and generating said bitstream by arranging the shortest-read-time 
data units (ILVU) in a particular sequence on the same time-base based on the read time of each shortest-read- 
time data unit (ILVU) so that it is possible to access and present without time-base intermittence only the shortest- 

25 read-time data units (ILVU) of the selected data units (VOB), said presentation time of each shortest-read-time data 
unit (ILVU) being the same. 

6. An interleaving method according to Claim 5 wherein the shortest-read-time data units (ILVU) are the same audio 
data. 

30 

7. An interleaving method according to Claim 5 wherein the shortest-read -time data units (ILVU) are completely self- 
contained data units, the presentation of which does not require referencing any other shortest-read-time data unit 
(ILVU). 

35 8. An interleaving method according to Claim 5 wherein the shortest-read-time data units (ILVU) are closed 
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